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Introduction 


Although the neutrosophic statistics has been 
defined since 1996, and published in the 1998 book 
Neutrosophy. / Neutrosophic Probability, Set, 
and Logic, it has not been developed since now. A 
similar fate had the neutrosophic probability that, 
except a few sporadic articles published in the 
meantime, it was barely developed in the 2013 book 


“Introduction to Neutrosophic Measure, 
Neutrosophic Integral, and Neutrosophic 
Probability”. 


Neutrosophic Statistics is an extension of the 
classical statistics,and one deals with set values 
instead of crisp values. 

In most of the classical statistics equations and 
formulas, one simply replaces several numbers by 
sets. And consequently, instead of operations with 
numbers, one uses operations with sets. One 
normally replaces the parameters that are 
indeterminate (imprecise, unsure, and even 
completely unknown). That’s why we made the 
convention that any number a that is replaced by a 
set be noted ay, meaning neutrosophic a, or 
imprecise, indeterminate a. ay can be a neighbour of 
a, can be an interval that includes a, and in general 
it can be any set that approximates a. In the worst 
scenario, ay can be unknown. In the best scenario 
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(when there is not indeterminacy related to a), an = 
a. 

Why this passage from crisp numbers to sets? 
Because in our real life we cannot always compute 
or provide exact values to the statistics 
characteristics, but we need to approximate them. 
This is one way to passing from classical to 
neutrosophic statistics, but other ways could be 
possible, depending on the types of 
indeterminacies, and the reader is kindly invited to 
do such research to be published in the next issues 
of the international journal of “Neutrosophic Sets 
and Systems”, http://fs.gallup.unm.edu/NSS/. 

The author would like to thank Prof. Yoshio 
Hada, the President of Okayama University of 
Science, Prof. Valery Kroumov from Okayama 
University of Science, Prof. Akira Inoue from the 
State University of Okayama, also Prof. Masahiro 
Inuiguchi, Dr.Masayo Tsurumi, and Dr. Yoshifumi 
Kusuroku from the University of Osaka, and Dr. 
Tomoe Entani from the Hyogo University for their 
valuable considerations and opinion during my 
postdoctoral research in Japan in December 2013 
and January 2014 about applications of the 
neutrosophic science in robotics and other fields. 

Any quantity computed with some indeterminacy 
from values in a sample (i.e. not exactly) is a 
neutrosophic statistics. 

A neutrosophic statistic is a random variable and 
as such has a neutrosophic probability distribution. 
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The long-run behaviour of a neutrosophic statistic’s 
values is described when one computes this 
statistic for many different samples, each of the 
same size. 

Neutrosophic Statistics is an extension of the 
classical statistics. While in classical statistics the 
data is known, formed by crisp numbers, in 
neutrosophic statistics the data has some 
indeterminacy. 

In the neutrosophic statistics, the data may be 
ambiguous, vague, imprecise, incomplete, even 
unknown. Instead of crisp numbers used in 
classical statistics, one uses sets (that respectively 
approximate these crisp numbers) in neutrosophic 
statistics. 

Also, in neutrosophic statistics the sample size 
may not be exactly known (for example the sample 
size could be between 90 and 100; this may happen 
because, for example, the statistician is not sure 
about 10 sample individuals if they belong or not to 
the population of interest; or because the 10 sample 
individuals only partially belong to the population 
of interest, while partially they don’t belong). 

In this example, the neutrosophic sample size is 
taken as an interval n = [90, 100], instead of a crisp 
number n = 90 (or n = 100) as in classical statistics. 

Another approach would be to only partially 
consider the data provided by the 10 sample 
individuals whose membership to the population of 
interest is only partial. 
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Neutrosophic Statistics 


Neutrosophic Statistics refers to a set of data, 
such that the data or a part of it are indeterminate 
in some degree, and to methods used to analyze the 
data. 
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In Classical Statistics all data are determined; 
this is the distinction between neutrosophic 
statistics and classical statistics. 

In many cases, when indeterminacy is zero, 
neutrosophic statistics coincides with classical 
statistics. 

We can use the neutrosophic measure for 
measuring the indeterminate data. 

The neutrosophic statistical methods will enable 
us to interpret and organize the neutrosophic data 
(data that may have some indeterminacies) in order 
to reveal underlying patterns. 

There are many approaches that can be used in 
neutrosophic statistics. We present several of them 
through examples, and afterwards generalizations 
for classes of examples. Yet, the reader can invent 
new approaches as well in studying the 
neutrosophic statistics. 

We emphasize, as in neutrosophic probability, 
that indeterminacy is different from randomness. 
While classical statistics is referring to randomness 
only, neutrosophic statistics is referring to both 
randomness and especially indeterminacy. 

Neutrosophic Descriptive Statistics is 
comprised of all techniques to summarize and 
describe the  neutrosophic numerical  data's 
characteristics. 

Since neutrosophic numerical data contain 
indeterminacies, the neutrosofic line graphs, and 
neutrosophic histograms are represented in 3D- 
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spaces, instead of 2D-spaces as in classical 
statistics. The third dimension, in addition of the 
XOY Cartesian System, is that of indeterminacy (I). 
From unclear graphical data displays we can 
extract neutrosophic (unclear) information. 

Neustrosophic Inferential Statistics consists of 
methods that permit the generalization from a 
neutrosophic sampling to a population from which 
it was selected the sample. 

Neutrosophic Data is the data that contains 
some indeterminacy. 

Similarly to the classical statistics it can be 
classified as: 

- discrete neutrosophic data, if the values 
are isolated points;for example: 6 +i}, where i, € 
[0, 1], 7, 26+ i, where i; € [3,5]; 

- and continuous neutrosophic data, if the 
values form one or more intervals, for example: 
[0, 0.8] or [0.1, 1.0] (i.e. not sure which one). 

Another classification: 

- quantitative (numerical) neutrosophic 
data; for example: a number in the interval [2, 5] 
(we do not know exactly), 47, 52, 67 or 69 (we do 
not know exactly); 

- and qualitative (categorical) neutrosophic 
data; for example: blue or red (we don’t know 
exactly), white, black or green or yellow (not 
knowing exactly). 

Also, we may have: 


INTRODUCTION TO NEUTROSOPHIC STATISTICS 





- univariate neutrosophic data, i.e. neutro- 
sophic data that consists of observations on a 
neutrosophic single attribute; 

- and multivariable neutrosophic data, i.e. 
neutrosophic data that consists of observations on 
two or more attributes. 

As a particular cases we mention the bivariate 
neutrosophic data, and trivariate neutrosophic 
data. 

A Neutrosopical Statistical Number N has the 
form: 

N=d+i, 
where dis the determinate (sure) part of N, and iis 
the indeterminate (unsure) part of N. 

For example, a = 5 + i, where 
i € [0,0.4], is equivalent toa € [5, 5.4], so for sure a > 5 
(meaning that the determinate part of ais 5), while 
the indeterminate part i €[0,0.4] means the 
possibility for number ,a” to be a little bigger than 
5. 

We may consider, similarly to the classical 
statistics, a neutrosophic stem-and-leaf display of 
data. 

For example, lets have the neutrosophic data 
that follows: 

6 + 4, withi, € (0, 0.2); 
7 + i;withi; € [2,3]; 
6 + iz, withis € [0, 1]; 
9 + i4, withi4 € [1.1, 1.5); 
9+ iy. 
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Its neutrosophic stem-and-leaf display is: 








6j&à i3 
7| à 
gli i, 
or under the form of interval: 
6 (0,0.2) [0,1] 
7 [2, 3] 
9/[(0,0.2) [1.1,1.5] 








Obviously a neutrosophic statistic number can 
be written in many ways. 

If you retake: a = 5 + i, withi € [0,0.4], then 
a = 4 + į, with i € [1, 1.4], ora = 3 + iz, with 
i; € [2, 2.4], and in general a = « + ix, with 
ix € [5—«, 5.4—«], and x any real number. 

Or in opposite way: 

a = 5.4 — i4, with i, € [0,0.4], 
and in general 

a — B — ig, with ig € [B —5.4,6 — 5|, and6 any real 
number. 

A Neutrosophic Frequency Distribution is a 
table displaying the categories, frequencies, and 
relative frequencies with some indeterminacies. 
Most often, indeterminacies occur due to imprecise, 
incomplete or unknown data related to frequency. 
As a consequence, relative frequency becomes 
imprecise, incomplete, or unknown too. 

An example about the neutrosophic frequency 
distribution concerning the number of accidents by 
car drivers. 
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Number of acdidena Nes fosochi: frequency Nevircouptk creative frequercy 
H $5 (0.185, 0.227) 
i (G0, &c| (0.240, 0.333 
2 [70, 90} [0.280, 0.375) 
3 (40, $al (0154, 0217) 
Total 63 1x 1M [0859, 1.1527 





How to read the previous table, let's say line #2: 
the number of car drivers with only one accident is 
between 60 and 80 (thus unclear information), and 
corresponding  neutrosophic relative frequency 
between 0.2240 and 0.333. 

To compute the total for the neutrosophic 
frequencies, where we have imprecise information, 
we compute the min and max of estimated 
frequencies: 

minns = 50 + 60 + 70 + 40 = 220, 
andmaxnr = 50+ 80+ 90+ 50 = 270. 

To compute the neutrosophic relative frequency, 
we also do the min and max of all possibilities. 

For zero accidents: 


l 50 
MUNnr f = 270 = 0.185 


5 
andmax,,y = 220 = 0.227, 
or 50 + [220,270] « [0.185, 0.227]. 


13 
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For one accident one has: 


60 
ine = — =0240 
MNnarf = so 460 +90 +50 
80 
maxnrf = LL LLL A 0.333. 


50 4- 80 4-70 4- 40 
For two accidents one has: 


70 
TT E META 
™Marf = 504-804 70 +50 


90 
d is NE 
SM atf = E0} 60+ 90 + 40 


The interval [0.280, 0.375] is different from: 
[70,90] + [220,270] = EE ~ [0,259, 0.409]. 
For three accidents one has: 


40 
sea x054 
Turf = S04 804 90 + 40 


50 
d =— °> AY 
andmaXnrf = E0 4 60 + 70+ 50 


and similarly the interval [0.154, 0.217] is 
different from: 
40 50 
270’ 220 

We simply cumulated the neutrosophic relative 
frequencies as an addition of intervals: 

[0.185, 0.227] + [0.240, 0.333] + [0.280, 0.375] 
+ [0.154, 0.217] = [0.859, 1.152]. 


[40, 50] + [220,270] = | ~ [0.148, 0.227]. 


Neutrosophic Statistical Graphsare graphs 
that have indeterminate (unclear, vague, 
ambiguous, unknown) data or curves. 

l.a. Neutrosophic Bar Graph: 
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Table: Time spent by an American daily 
T=watching TV: between [4,5] hours; 
B=reading books: between [1,2] hours; 
D=driving: between [1,3] hours; 
S=sleeping: between [6,9] hours. 


2.a. Neutrosophic Circle Graph for the same 
example: 
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3.a. Neutrosophic Double Line Graph for the 
same example: 
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4.a. Neutrosophic Line Plot for the same 
example: 


x - one hour 


= one possible hour 
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5.a. Neutrosophic Pictograph for the same 
example: 





























Green color rectangle: one hour 
Red color rectangle: one possible hour 


6.a. Neutrosophic 2D Histogram is a neutro- 
sophic bar graph such that the bars are vertical, 
there is no gap between bars (the bars of height zero 
are also included), and the width of each bar has 
the size of the represented interval. It shows, within 
a certain interval, the approximate number of times 
data occur. 
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Neutrosophic distribution of 
family income frequencies 


Frequency of families (in millions) 





10-29 30-49 50 - 69 70-89 90-109 110 - 129 


Income (in thousand $) per year 


where — indicates in the numbering scale a 
distortion. 

The frequencies are not crisp numbers as in 
classical statistics, but between some limits. For 
example, the number of families with income in 
between $10,000 - $29,000 is between 7 and 9 
millions of families. Similarly for other classes of 
income, except for the last class of income in 
between $110,000 - $129,000 that corresponds to 
a crisp number: 1 million of families. 


19 
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We represented all types of neutrosophic 
statistical graphs in a space of dimension two (2D) 
as in classical statistics, but it is also possible to 
make the graphs in a space o dimension three (3D), 
just adding to each of the previous 2D-graphs an 
indeterminate dimension, which measures the 
indeterminacy of the data. 


1.b. The Neutrosophic 3D Bar Graph 








The deepness axis (i) measures the indeterminacy. 
For the previous example: Time spent by an American daily. 


20 


INTRODUCTION TO NEUTROSOPHIC STATISTICS 








2.b. Neutrosophic Cylinder Graph 












The heights (that represent indeterminacies) of T 
and B are the same, while the height of D is double, 
and the height of S is triple. 


3.b. TheNeutrosophic 3D-Line Graph 
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for the same example. We plot the points of 
coordinates (T, 4, 1), (B, 1, 1), (D, 1, 2), and (S, 6, 
3), where the second component represents the 
determinate part (y) and the third component the 
maximum indeterminacy (i), and connect them. We 
get a 3D curve. 


4.b. Neutrosophic 3D Plot for the same 
example: 





had 
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5.b. Neutrosophic 3D Pictograph for the same 
example: 



































6.b. Neutrosophic 3D Histogram for the same 
example of Neutrosophic distribution of family 
income frequencies: 


Neutrosophic distribution of family income frequencies 



































Frequency of families (in millions) 


oO eN 











10-29 30-49 50-69 70-89 90-109 110-129 
Income (in thousand $) per year 
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Statistical Deceptions can be expressed in the 
neutrosophic way. For example: 


a) 


b) 


c) 


"Company's heating bill went up to 1096 
last year." In a neutrosophic way we can 
write: [0, 10]% (which could be any 
number between O and 10, including the 
extremes). 

^We guarantee you lose as much as 15 
pounds in a month, or your money back.” 
Actually you lose [0, 15] pounds, so you 
may lose no pound! 

“No product is better than Brian's." This 
means that other products could be the 
same as Brian's! 
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Neutrosophic Quartiles 


Lets consider the set of  neutrosophic 
observations of a variable listed in almost ascending 
order (since we deal with sets instead of crisp 
numbers we have a partial order). 

The neutrosophic quartiles are similarly as in 
classical statistics defined: the first (lower quartile) 


is the zn +1)th, the second is the =(n +1)th, and 


the third the =(n + Dt. 


If (n + 1) is not divisible by 4, then one takes the 
average of the two neutrosophic observations whose 
ranks the quartile falls in between. Another 
procedure is to take the inferior integer part of 


=(n 1), for i = 1,2,3. 


Let's compute the midpoint of a set LI in the 


following way: 
infu - sup LI 
2 
We can define a total order on the n neutrosophic 


observation sets in the following way: 
for any sets U and V we have L«v if 
either midpoint (LI) < midpoint(V), 
Loin e (U) = midpoint(V)and min U< min V. 
If it happens that 
midpoint (LI) 2 midpoint(V) 
and min U= min V, 
then automatically max U = maxv, therefore 
LIzv. 


midpoint U= 
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An example with n = 12 ascending neutrosophic 
observations: 


1, (2,3), |{4, 6}, 5} [7, 10], |[7, 11], 9} 12,]14, [14, 15) | 20, 
{21} U (22, 25]. 
First quartile: 


1 1 
m 1)2-(2-71)- 325 
10 * D-40241 = 3.25, 


then we average the 34 and the 4th ranked 
observations: 
(4,6) tS) {4+5,6+5} (9H. P 11 


; ; 24553. 
Second quartile: 


2 2 
7 1) =-(12 +1) = 6.50 
qt) =F (12+ 1 = 650, 


then we average the 6t and 7th ranked 
observations: 
[7,11]+9 [749 114+9 
pr Ez 2 
Third quartile: 





| = 18 101 


3 3 
— 1) =-(12 + 1) = 9.75 
qt) - 0241) = 975, 


then we average the 9th and 10t ranked 
observations: 
14 + [14,15] [14-14 14-4 15 


2 2.00.02 








= [14, 14.5). 
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Neutrosophic Sample 


A Neutrosophic Sample is a chosen subset of a 
population, subset that contains some 
indeterminacy: either with respect to several of its 
individuals (that might not belong to the population 
we study, or they might only partially belong to it), 
or with respect to the subset as a whole. While the 
classical samples provide accurate information, the 
neutrosophic samples provide vague or incomplete 
information. 

By language abuse one can say that any sample 
is a neutrosophic sample, since one may consider 
their determinacy equals to zero. 

Neutrosophic Survey Results are survey results 
that contain some indeterminacy. 

A Neutrosophic Population is a population not 
well determined at the level of membership (i.e. not 
sure if some individuals belong or do not belong to 
the population). 

For example, as in the neutrosophic set, a 
generic element x belongs to the neutrosophic 
population M in the following way, x(t,i,f)EM, 
which means: x is t % in the population M, f %x is 
not in the population M, while i % the appurtenance 
of x to M is indeterminate (unknown, unclear, 
neutral: neither in the population nor outside). 

Example. Let's consider the population of a 
country Cı. Most people in this country have only 
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the citizenship of the country, therefore they belong 
100% to Cı. But there are people that have double 
citizenships, of countries C; and C2. Those people 
belong 50% to Ci, and 50% to C2. While citizens 
with triple citizenships of countries Ci, Co, and C3 
belong only 33.33% to each country. Of course, 
considering various criteria these percentages may 
differ. Also, there are countries with autonomous 
zones, whose citizens in these zones may not 
entirely consider themselves as belonging to those 
countries. 

But there is another category of people that have 
been stripped from their C; citizenship for political 
reasons and they have other citizenship, while still 
living (temporarily) in Ci;.They are called paria, and 
they do not belong to C; (not having citizenship), but 
still belong to C; (because they still living in Ci). 
They form the indeterminate part of neutrosophic 
population of country C1. 

A simple random neutrosophic sample of size 
n from a classical or neutrosophic population is a 
sample of n individuals such that at least one of 
them has some indeterminacy. 

Example. One considers a random sample of 
1,000 homes, in a city of over one million 
inhabitants, in order to investigate how many 
houses have at least a laptop. One finds out that 
600 houses have at least one laptop, 300 houses 
don't have any laptop, while 100 houses have each 
of them a single laptop, but not working. 
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Some of these 100 house owners tried to have 
their laptop fixed, others said their laptops’ hard 
drives have crashed and it is little chance to fix 
them. Therefore indeterminacy. We have a simple 
random neutrosophic sample of size 100. 

Similarly as in classical statistics, in a stratified 
random neutrosophic sampling the pollster 
groups the (classical or neutrosophic) population by 
a strata according to a classification; afterwards the 
pollster takes a random sample (of appropriate size 
according to a criterion) from each group. If there is 
some indeterminacy, we deal with neutrosophic 
sampling. 

Example. One considers two strata: men and 
women in the city of Gallup, New Mexico. But, since 
women represent 51% of the population and men 
49%, one takes a random sample of 51 women and 
a random sample of 49 men. 

But later learn that ,one" man and two ^women" 
are actually transgender. Therefore 3 individuals 
are indeterminate. Whence one has stratified 
random neutrosophic sampling. 

If the (classical or neutrosophic) population is 
divided into subgroups, such that each subgroup is 
representative of the population, and then one 
collects from these subgroups a random sample 
and there is some indeterminacy, then one has a 
neutrosophic cluster sampling. 

Example. Suppose 5 professors conduct PhD 
dissertations in neutrosophic statistics. Each 
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professor has a number of graduate students, but 
some students are undecided whether to pursue 
their dissertations in classical or neutrosophic 
statistics. The professors represent the clusters. 
One randomly selects 2 professors to interview their 
students about research in neutrosophic statistics. 
But, because some students are undecided 
(indeterminate) with respect to their research topic, 
we have a neutrosophic cluster sampling. 

A convenience sample is likely to be inaccurate 
since the pollster selects a sample of individuals 
that are readily available, who might answer 
randomly to the questions in order to finish faster. 
The less the individuals are interested in the survey 
results, the more likely inaccurate are the survey 
results. While a voluntary-response sample is 
more likely to be biased, since the sample 
individuals may volunteer in purpose to influence 
the survey results. 

Besides these two categories of sample 
individuals there is another one of malicious people 
that might oppositely answer to the questions in 
order to produce false results. 

That’s why data of some sample individuals has 
to be removed, but often we don’t know which ones. 
Therefore, we have indeterminacy related to the 
sample size: how many sample people were from the 
above three categories, and how to depict their data 
in order to remove them from the survey results? 
Again, neutrosophic statistics. 
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Neutrosophic Numerical 
Measures 


Example with Neutrosophic Numbers a+bl , 
where a, bare real numbers, and lis indeterminacy, 
such that I? = I and 0-1 =0. 


Let's have the neutrosophic numbers: 


—2—41, —1 + 0:1, 34 51,6 +71. 
Compute their mean: 


3l 


Florentin Smarandache 





(-2-4) + (-14+0-D+@G+4+5)+(6+7) | 
—— ———— 


apes BOE eder unt 
M 4 4 
Compute their median: 
(-1-0-D-(345) -1+3 0+5 


——1-1-* 2I. 
2 2 T 2 2 


Compute the deviation of each neutrosophic 
number with respect to the mean: 
(2 — 41) — (1.5 + 2I) = —3.5 — 65 
(-1 + 0-7) -— (1.5 + 21) = -2.5 — 2I, 
(3 +50 — (1.5 + 2I) = 1.5 + 3I, 
(6 +70 — (1.5 + 2I) = 4.5 + BI. 
Square the deviations: 
(—3.5 — 61)? = (—3.5)? + 2(-3.5)(-6)1 + (—6)?/? 
= 12.25 + 421 + 36I? = 12.25 + 421 + 36I 
= 12.25 + 781 
(—2.5 21)? = 625 + 141I 
(1.5 + 3D)? = 2.25 +18] 
(4.5 + 51)? = 20.25 +701. 
We are following the formula: 
(a + bI)? = a? + 2abl + b?I? 
= a? + 2abl + b?I 


°2=1.5+42I. 





or 
(a + bI)? = a? + (2ab + b?)I. 


Compute the standard deviation: 
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sz 





4 
= V 10.25 + 45I. 


To compute the square root of a neutrosophic 
number we denote the result as x+yl and 
determine x and y: 

V10.25 + 45] = x + yl. 

Raise both sides to the second power: 

10.25 + 45] = x? + (2xy + y?)I. 
Therefore: 


E [es +781) + (6.25 +141) + (2.25 + 187) + (20.25 + 701) 


f 10.25 = x? 

45 = 2xy + y? 

Since standard deviation is positive, we take 
x = +V10.25 = 3.20 

and replace it into the second equation: 
45 = 2(3.20)y + y? 

and solve for positive y: 
y? +6.4y—45=0 

whence 


—6.4 + 6.42 — 4(1)(—45) 
ay 73. 0.64. 
Therefore, the neutrosophic standard deviation 
of the previous four neutrosophic numbers is 
3.20 + 0.641. 
We observe that 3.20 is the classical standard 
deviation of the determinate parts of the previous 


neutrosophic numbers: —2,—1,3,6; but 0.64 is not 
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the classical standard deviation of the 
indeterminate parts of the previous neutrosophic 
numbers: —4,0,5,7. 

The classical standard deviation of the numbers 
-4, 0, 5, 7, whose mean is 2, is: 


e UE 


Indeterminacy has propagated when squaring 
the deviations. 


Classical Neutrosophic 
Numbers 


A classical Neutrosophic Number has the 
standard form: 
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a t bl, 
where a, b are real or complex coefficients, and I 
= indeterminacy, such 0:7 = 0 and I? = I. 
It results that I” = I for all positive integer n. 
If the coefficients a and b are real, then a+ bI is 
called Neutrosophic Real Number. 


Examples: 2+ 31, —5 + zL, etc. 


But if the coefficients a and b be are complex, 
then a+bI is called Neutrosophic Complex 
Number. 

Examples: (5+ 2i)+ (2—8i)LI--i-9I-—il, etc. 
where i = V/-1. 

A neutrosophic complex number can be better 
written as: 

à + bi t cI + dil, where a, b, c, and d are reals. 

Of course, any real number can be considered, 
by language abuse, a neutrosophic number. 

For example: 

5=5+0-], 

or 

5=5+0-i+0:/+0-i-l. 

We call it a degenerated neutrosophic number. 

A true neutrosophic number contains the 
indeterminacy J with a non-zero coefficient. 


Division of classical neutrosophic real numbers. 


(a, + bil) = (az + bl) E ? 
We denote the result by: 
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a, + byl 
a + by! 
then multiply and identify the coefficients: 
a, + byl = (x + yl)(az + b31) 
= xa; + xb4l + yazl + yb!" 
(a5x) + (box + azy + b5y)I. 
Whence we form an algebraic system of 
equations, by identifying the coefficients: 


— x t yl, 


aX =a, 
box + any + boy = bi 
or 
A2xX = 04 
box + (a + b2)y = b, 
One obtains unique solution only when the 
a, 0 
b; cages? 
Or a5(a5 + b2) + 0. Hence a; + 0 and a; + —b; are 
that conditions for the division of neutrosophic real 
numbers 


determinant of second order 








a, + bil 
az + bz! 
to exist. 
Then 
Q4 
x=— 
a2 
and 
azb, — a4b; 
JO 
az (az + b2) 
or 
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a, tb a ajb,—a,b; 
az +b,I a, @(az +b) ` 


As consequences, we have: 


GIO uS ato A for k a non-zero real 
ak*bkl  k(a+b)  k 


number, and for a + 0 and a + —b. 


I a 1 
Cab aap) ^ wp fOr a + O and a + —b. 


3. Divisions by I, -I, and in general by KI, for k 
a real, are undefined. 





























ator = undefined, for any real k, and any realsa 
and b. 
In particular: 
I 
y^ undefined; 
7I 
TF undefined; 
ae defined 
-57 7 undefined; 
a+ bI ; 
oF undefined; 
a+ bl À 
aj undefined. 
4. a+bI - UN L forc + 0; 
C C C 
c c bc 
5. aibi do Urb) I, for a + 0 and a x —b. 
6. 997.2 for b #0 (the classical division of 
b+01 b 
reals). 
7. arbor esque oMesur quat 
1 1 1 
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0 0 a:0-0-b 
mE a aum 0H TD for a+ 
0 and a + —b. 


E EE p for any real k, and a + 0 and a + 
a+bI a+b 





—b. 
Let's fo a concrete example by calculation. 
What is (2 - 3I) - (14 1) =? 

Denote: 
2 3I 
1-I 





— x t yl. 
One has: 
(1 D(xctyl-9x-yltxlt yl? 22431 
x+(x4+2y)l 22-43l. 





x=2 

Whence} . +2y =3 
or x = 2,y = 0.5. 
There 

2+ 3] 

ar 2 +0.57 
Let’s check: 

2+ 3] 

zpos Y 
Then 


(2 0.5) (x + yI) = 2 4 3l, 
2x + (2y + 0,5x + 0.5y) = 2 + 3I. 


Whence 
l 2x22 
0.5x + 2.5y = 3’ 
hence 
x=1, y=1, 
or 
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EM —14-1:121-I 
24051 — M 
Perfect. 
Another example. 
2+ 3! ET. 
Sr T? 
Whence 
8x = 2 
12x + 12y + 8y =3 
and we get 
2 Ka 1 
Eg up 
and 


1 
12 (=) + 20y =3, ory = 0. 


Therefore 
243] 1 1 
BEDI 4 574 
which is a neutrosophic simplification since: 
2-31] 31:(243I]) 1 


8-121] 4-(243]) 4 


Now an example which is undefined: 





243l 
———— z? 
1-1 

2430. ou 

qe Se 


(1— D(x * y) 22431 

x t yl — xl — yl? 2 2 31 
Or 

x-*(y-x—y) 2243I 
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or 
x—-xl=2+4+3i, 


therefore 
x=2 and -—x=3, 


which is impossible. 





Therefore 
Z2 3f 
bpm undefined. 
And an example where it results infinitely 
solutions: 
- =? 
Denote 
I 
p X t yl, 
SO 
I(x * yl) =], 
or 
xl + yl? 8 I, 
or 
(x+y)l=1-], 


whence x +y = 1, where x and y are unknown 


reals. 
We get infinitely many solutions: 


xeRandy-21-x, 
where R is the set of real numbers. Among 


solutions there are: 
1, I, 2-1, etc. 
But since the division's result should be unique, 


we say that 
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I 
I — undefined. 


Root index n > 2 of a neutrosophic real number. 


First lets compute the square root: 

Va + bI, where a, b are reals. 

Let's denote: 

và * bl — x * yl, 

where x and y are real unknowns, and raise both 
sides to the second power. One gets: 
a t bl = (x + yl)? = x? + 2xyl + y?I? = x? + 2xyl + y?l 

= x? + (2xy + y”). 


x =a 
Whence an 
Hence 5 x= iva 
y?^t2ya:y-b-0 


and we solve the second equation for y: 
F2Vat+v4a+4b F2Vat2Va+b 
a. ONE XS 
= FVatva+b, 


and the four solution are: 
(x,y) = (Va, —Va + Va + b), (Va, —Va — Va + b), 
(—Va, Va + Va + b), or(—Va, Va — Va + b). 
Thus: 


y 


Va +bI 2 Va + (—-Va + Va + b)I, 
or 
va — (Va + Va * b)I, 


Or 
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—Va + (Va + Va + b)I, 


or 


-ya + (Va - Va b)I. 


Lets consider an example done through all 
calculations: 


V9+ 7I =? 
Let’s denote: 
v9 t 7I — x * yl. 


Then: 
9 7I = x? +2xyl + y?I? = x?  (2xy + y?)1. 
Whence 
x? 2 9,0rx = +3 
| 2xy +y? =7 
Let’s find y: 
x=3 x=-3 
6y +y? =7 —6 +y? =7 
y*+6y-7=0 y?-—6y-7=0 
y*7(y-1-0 Q-)Q+1)=0 
y=-7/y=1 y=7/y=-1 
(35523 1). ane aE 

Therefore, V9 + 7I = +3 + I (four solutions). 
As a particular case we can compute vI. 
Let's consider VI = x + yl, then 

O+1:-l=x*+(2xy+y?):1 
and we need to find x and y. 
Whence x? = 0,orx = 0, 
and 2xy + y? = 1,or y? = 1,or y = +1. 
Hence VI = +I. 
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Similarly for ‘VI. 

Let's consider VI = x + yl, 

or 0 +1:1 =x" + (pz iy xk). 
wherex" = 0, or x = 0, 


and 
n-1 
2: Cynk xk = 1, 
k=0 


or y" = 1, whence y = Y1 and we get n solutions: 
a real solution y = 1 and n — 1 complex solutions in 
the case we are interested in neutrosophic complex 
solutions as roots index n of 1. 
In the same way, we can compute root index n => 
2 of any neutrosophic number: 
Va—bl 2x4 yl 
Or 
a t bl = (x t yI)* 


n-1 
=x" + (» + 2: dte) -q= 


k=0 


n-1 
=x" + (> dte) T, 
k=0 


where Ck means combination of n elements taken 
by groups of k elements. 

Whence x = Va if n is odd, or x= +¥Va if n is 
even, 

and 


n-1 ^ 
> chy ka = b, 
k=0 
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and solve it for y. 

When the x and y solutions are real, we get 
neutrosophic real solutions, and when x and y 
solutions are complex, we get neutrosophic complex 
solutions. 

Let a + bi + cI + dil be a neutrosophic complex 
number, where a, b, c, d are reals. Lets compute 
square root of it: 


(Va + bi+cl + dil) = (x + yi + zl + wil)? 
a+ bi t cl + dil 
= x? — y? t z?? + w?i?I?  2xyi + 2xzl 
+ 2xwil + 2yzil + 2ywi?I + 2zwil? 
= x? — y? + 77] — w?I + 2xyi + 2xzl 
+ 2xwil + 2yzil — 2ywI + 2zwil 
= (x? — y?) + 2xyi 
+ (z? — w? + 2xz — 2yw)I 
+ (2xw + 2yz + 2zwjil. 
Then we get a non-linear algebraic system in four 
variables (x, y, z, w) and four equations: 
x? -y =a 
2xy = b 
z? — w? + 2xz — 2yw =c¢ 
2xw + 2yz + 2zw = d. 
In a more general way, we can compute root 
indexn of a neutrosophic complex number: 


1 
(a + bi +cI + diỌl)n = x + yi + zI + wil, 
where x, y, z, w are variables in the set of real 


numbers. 
Raising to the power n in both sides, one gets: 
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a t bi t cl + dil = (x + yi + zl + wil)” 
= fio y) + fzx, y)i + faGo y, w, z)l 
+ fiGoy,w, z)il, 
where fi, f;, fa, fa are real functions. 
Whence we get a non-linear algebraic system in 
four variables (x, y, w, z) and four equations: 


Rho y) —a 
hy) =b 
fs y,w,Z) =c 


fa Go, y,w,z) =d, 
that we need to solve. 
Similarly, one can compute square root of a 
complex number. 
Let a + bi, where i = V—1, and a, b are reals, be a 
complex number. 
Va+bi=x+yi such that (x + yi)? = a + bi, 
where xand y are real numbers; 
or x? + 2xyi + y?i? 2 a + bi, 
or (x? — y?) + (2xy)i = a + bi, 
whence ed b. Í 
From the first equation x = tJ y? + ais replaced 
into the second equation: 
+2y,/y? + a = b. (RE) 
Raising both sides to the second power one gets: 
4y*(y* +a) = b*, 
or 
4y* + 4ay? — b? = 0. 
Let z = y?. Then 4z? + 4az — b? = 0, then 
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| —4a+J16a*—4-4(—b*) —4a + V16a* + 16b? 


2(4) i 8 
| -4at4Wab kb? at Var +b? 
ze - 2 : 
Then 
|-a+ Yat vj 
y= tis > 
2 
and 
b b +b 
X m mL— m OOOO Es 
2Y — (5 |-atva7 2 42a £2Na? +b? 
s 2 2 
for y + 0. 


Since (RE) is a radical equation, we need to check 
each solution of unknown y to make sure the 
solution is not extraneous. 


. -a-cva?-4 b? 
Becauseva? + b? 2 +a, the expression DOM MES 2 


0, therefore there are at least two real values for y, 
—a + Va? + b? 

2 d 
while -a — va? + b? < 0 and one has equality only 


when b = 0, resulting in y= 0. 


As a particular case, Vi = d + 5. i, or - 5 — n 


cs 


since we write: 

i=0+1-:i, whence a=0, b= 1, 

and we replace both of them into the x and y of 
previous formulas. 

We can check the results: 
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= — 2:-i —i22- |——-i 
2 4 + 2! t 2 2 +1 2 i, 
VZ vZ ) = 
—- =i) =i. 
2 2 
Lets have another example, doing all 
calculations: 


A 2 2 2., 1 1 
2 


and similarly (- 


v3—4i=? 
Denote v3—4i=x+yi. 


Then 
3 — 4i = (x + yi)? = (x? — y?) + (2xy)i. 

Whence 

ý - y? =3 

2xy = —4. 
Solve this system. 
From the second equation, y = Z and replace y 
into the first: 


or 

x? —-— -3 =0, 
or 

xt —3x?2-4=0, 
or 

(x? — 4)(x? +1) = 0, 
whence 
x?—420, 
Or 
X — 2. 
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Then 


a> ndo 
y= 


=—= F¥1. 
x +2 


Solutions: 
V3- 4i = t(2- i) 
Checking the result: 
[+(2 — D]? = 4- 4i + i? = 3 — 4i. 
Remarkably, well get the same solutions if we 
take the complex values of x and y, because: 


x? +1 = 0 gives x = +V-1 = +i, 


and replacing them into the substitution y = =a 
we get: 


Then 


V3- 4i = x + yi = ti + 2i -i = +i + 2(-1) = F2 ti 
= +(2-i). 


One generalizes this procedure and one 


computes root index n of any complex number: 
Va+bi=? 


Similarly denote: 


Va 4 bi =x + yi, 
then 
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a t bi = (x + yi)" = (yi + x)” 


n 
B 
E pi C2 y2ki2k yn-2k 
k=0 
n-1 


E 
+ > C2k*1y2len j2k+1 yn-2k-1, 
k=0 
and one obtains a non-linear algebraic system of 
degree n, in two variables x and y, and two 
equations: 


Cy 1a =a 


Carty ehtt eyed =p 
k=0 
that one solves with a computer program. 
As a particular case, let’s compute the cubic root 
of a complex number: 


Va+tbi=? 
Then: 
Va +bi =x + yi, 
or 


at bi = (x + yi)? = x? + 3x? yi + 3xy?i? + y?i? 
= (x? — 3xy?) + (3x?y — y?)!, 
whence 
x? — Sry? =a 
2 —-y3zb 
and solve for x and y. 
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From the first equation: 


Li x3—a 
ym 3x ' 


Replace this substitution into the second 
equation: 











and solve this independent equation for x with a 
calculator, and then find y from the above 
substitution. 

For example: 


Vi = -i. 
Neutrosophic Real or Complex Polynomial. 


A polynomial whose coefficients (at least one of 
them containing ] are neutrosophic numbers is 
called Neutrosophic Polynomials. 

Similarly we may have Neutrosophic Real 
Polynomials if its coefficients are neutrosophic real 
numbers, and Neutrosophic Complex 
Polynomials if its coefficients are neutrosophic 
complex numbers. 

Examples: 

P(x) =x? + (2—I)x—5+3I 
is a neutrosophic real polynomial, while 
Q(x) = 3x? + (1 + 6i)x? + 5Ix — 4il 
is a neutrosophic complex polynomial. 
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From these polynomials we proceed to solving 
Neutrosophic Real or Complex Polynomial 
Equations. 

Let’s consider the following neutrosophic real 
polynomial equation: 

6x? + (10 — Dx +31 = 0, 
and solve it just using the quadratic fromula: 
" —-(10 — I) + /(0 — D? — 4(6)(31) 


2(6) 
—10 + I + V100 — 201 + I? — 721 
= 12 
—10 + I + V100 — 201 +I — 721 
~ 12 
—10 + I + V100 — 917 


12 
Now, we need to compute V100 — 911. 
Let's denote: V100 — 91] = a + BI, 
where a, P are reals. 
Raise both sides to the second power: 
100 — 91] = a? + 2aflI + B?1? = a? + 2aßl + B?I 
= a? + (2aß + B7)I, 
whence 
f a* = 100 
2aß + B2 = —91. 
Hence a = +V100 = +10. 
1. If a=10, then 2(10)8 - p? 2 —91, or B* + 
206 + 91 = 0. Using the quadratic formula, one gets: 
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a —20 + 420? — 4(1)91 _ —20 + v400 — 364 


2 2 
-2046 
-2046 3477 
= ——_ = ( 
2 ee 
5 


2. Ifa — —10, then f? — 208 + 91 = 0, 
js 20 + /(—20)? — 4(1)91 _ 20 + V400 — 364 











2 2 
20+6 
20+6 z la 
— = ( 
2 20-6 , 
; l 


The four solutions are: 

(a, B) = (10, —7), (10, —13), (—10, 13), (—10, 7). 
We go back now and find x: 

—10 +I + V100 -9 +I 
E 12 


x 


Therefore, we previously found out that 
v100— 917 = 10-71, or —10- 7l, or 10—13I/, or 


—10 + 13I. 


Since one has + in front of the radical, 10 — 7i 
and —10 t 7I get the same values for x. Similarly, 


10 — 13] and —10 + 131. 
-10 +I (10-71 


X12 = 12 

S E E S vd 
a e a 
asc 10:621. 204-80. . 52 
pg a 
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—10 +1 + (10 — 13/) 
X1,2 = —— 


12 
-10 +1 +10- 13I _ -12I _ : 
zm 12 Bb EE. 
—10+/-10+13/ -20+141 10 7 
12 |. 12 6 6" 


We got four neutrosophic solutions 


[725-54 $n, 7, - I Z1] for 
2 3 3 6 6 

a neutrosophic real polynomial of degree 2. 

First neutrosophic factoring: 


P(x) = 6x? + (10 - Dx + 31 = e[x - (-z0)|- x 


Cio) ose) pend 


Second neutrosophic factoring: 
P(x) = 6x? + (10 — Dx + 3I 


= 6[x — (-1)]: |x z (2+1) 
7 


=6( +51)( pa 1) 
= X 2 Xx E E " 


Differently from the classical polynomial with 
real or complex coefficients, the neutrosophic 
polynomials do not have a unique factoring! 

If we check each solution, we get: 

P(x1) = P(x;) = P(x3) = P(x4) = 0. 

Let’s compute: 
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-10 7 -10 7 
» = 140 , 29 J Ap 
| (36 36 36 6 6 
PE k 7 ? 4 al 
6 6 


100 140-1 49-1 100 70-1 10-1 
+ + 








Ae gs waar a age 6 6 
7:1 , 181 

6 6 
| —1401 + 491 +701 +101 71 +181 0-1 
= - = 
-2-0 


Another procedure of factoring a neutrosophic 

real polynomial equation is the following. 

Let's have 

P(x) = (A B:Dx? - (C -D:Dx- (Ec F1) —0. 

Suppose x = a, + bl and x; = a; + bI are two 

neutrosophic real solutions of P(x) = 0. 

Then: 

P(x) = (A + B: D[x — (a, + dD] : [x — (a2 + b;1)] 
z(A-c-B:Dx? - (C -D:D)x 
+(E+F-1). 

We multiply on the second right hand side, and 

then we identify the neutrosophic coefficients, and 
solve for a4, b4, a;andb;. 
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Research Problems. 


1. In general, how many neutrosophic solutions 
has a neutrosophic real polynomial equation of 
degreen 2 1? 

So far, we know that such equation of degree 1 
has none (in the case the neutrosophic division is 
undefined) or one solution (in the case the 
neutrosophic division is well defined). 

2. How many different factorings, with factors 
of first degree, are possible for a neutrosophic real 
polynomial of degree n? We got two different 
factorings for aparticular polynomial of degree 2. 

3. - 4. Similar problems for neutrosophic 
complex polynomial equations and neutrosophic 
complex polynomials of degree n > 1. 
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Neutrosophic Random 
Numbers 


Neutrosophic Random Numbers can also be 
generated using, instead of only crisp numbers, a 
pool of sets. For example, let’s suppose one has 100 
balls and on each ball is written an interval [a, b] 
where a,b € {1,2,3,...,100}and a x b. 

When a=b we get a crisp number [a,a] =a, 
while for a « b we get a set/a, bj. 

Then randomly one extracts a ball, one registers 
its interval, then one returns it back to the pool. 
And so on. Instead of a random sequence of crisp 
numbers, we get a random sequence of intervals. 
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Example with 
Neutrosophic Data 


Let’s have the following four observations: 


6, [2, 5], 30, [18, 24]. 

The second and fourth observations are unclear, 
i.e. [2,5] means a number in this interval, but we 
don’t know which one; similarly for [18, 24]. 
Therefore we have two indeterminacies. 

In order to  uniformize let’s rewrite all 
observations as intervals: 

[6, 6], [2, 5], [30, 30], [18, 24]. 

Each observations can be a subset, not 
necessarily a crisp number a (closed, open, half 
closed - half open) interval. 

Compute the median: 

[2,5] + [30,30] [2+30,5+30] [32,35] [32 35 
2 Z 2 || 2 2' | 
= [16, 17.5]. 

Therefore the medium is a number between 16 
and 17.5. 

One computes their mean: 

[6,6] + [2, 5] + [30, 30] + [18, 24] 








4 

| [6 2 30 € 18,6 - 5 +30 +24] 
7 4 

[56,65] [56 65] _ [14,1625] 

EE PES Pal = oo 
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Therefore the average is a number between 14 
and 16.25. 
Compute the deviations and square them: 
a. [6,6] — [14, 16.2] = [6 — 
16.2,6 — 14] = [10.2, —8]; 
[10.2, -8]? = [-10.2, —8] - [-10.2, —8] 
= [(-8)(-8), (-10.2) - (-10.2)] 
— [64,104.04]. 
b. [2,5] — [14, 16.25] = 
[2 — 16.25, 5 — 14] = [—14,25, —9]; 
[714.25, —9]? = [(—9)?, (—14.25)?] = [81, 203.0625]; 
c. [30, 30] — [14,16.2] = 
[30 — 16.2, 30 — 14] = [13.8, 16]; 
[13.8, 16]? = [13.8?,16?] = [190.44, 256]; 
d. [18, 24] — [14,16.2] = 
[18 — 16.2, 24 — 14] = [1.8, 10]; 
[1.2, 10]? = [1.8?, 10?] = [3.24, 100]. 


Compute the standard deviation: 


[64, 104.04] + [81, 203.0625] + [190.44, 256] + [3.24, 100] 
4 


E E: + 81 + 190.44 + 3.24 104.04 + 203.0625 + 256 + w 8 
Sab’. =a“  — 45 — m 


= ,/[84.67, 165.775625 = 
[V84.67,V165.775625] = [9.20163, 12.8754]. 


58 


INTRODUCTION TO NEUTROSOPHIC STATISTICS 





Indeterminacy related to 
the sample size 


Suppose one has the following five observations: 


17,12,5,8,9, 
but one of them is certainly wrong, yet we don’t 
know which one. 
What to do to approximate the calculations? 
Let’s first increasable reorder the observations: 
5, 8,9,12,17, 
and then study all possibilities. 


Correct Median Deviations Squared Standard 
Pics RN Observations Deviations Deviation 


33.0625 
3.0625 
1.5625 

39.0625 








22.5625 
3.0625 

0.5625 

52.5625 
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Now we combine the five results. 
a. Interval style: 
the median belongs to the interval [8.5, 10.5]; 
the mean belongs to the interval [8.5, 11.5]; 
the standard deviation belongs to the interval [2.5, 4.43706]. 
b. Average Style: 
10.5 + 10.5 + 10.0 + 8.5 + 8.5 


5 


11.5 + 10.75 + 10.5 + 9.75 + 8.5 
themean = S A 10.2; 


themedian = 9.6; 


andstandarddeviation 
= 3.5 + 4.38035 + 4.5 + 4.43706 + 2.5 


5 
= 3.86348. 


c. Weight Average Style: 

One assigns a weight to each sample. The sample 
weight may represent the chance that the respective 
sample could be the right sample, after discarding 
the wrong observations. 

In general, the weights w4,w;, ...,w, € [0,1] such 
that 

W4 +W +e +w = 1. 

In the case when the sample weights are 
determined from criteria different from each other 
and therefore the sum of weights is not 1, and the 
observations are a,+a,+:+a,, the weight 
average is: 

W404 + W202 + °° + Wyn 
Wi +W2+--+Wy 
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In our example, if w, = 0.4,w; = 0.1,w3 = 0.3,w, = 
0.2,w, = 0.7, then: 
theweightedaveragemedian 

0.4(10.5) + 0.1(10.5) + 0.3(10.0) + 0.2(8.5) + 0.7(8.5) 
=; — Oat os40ne07= ~~ 
= 9.35294; 
theweightedaveragemean 

0.4(11.5) + 0.1(10.75) + 0.3(10.5) + 0.2(9.75) + 0.7(8.5) 


0.4 + 0.1 + 0.3 + 0.2 + 0.7 
= 9.83824 


and theweightedaveragedeviation 

0.4(3.5) + 0.1(4.38035) + 0.3(4.5) + 0.2(4.43706) + 0.7(2.5) 
= —QOATÜl403402407 — 
= 3.42673. 

According to the sample weights, it's a larger 
chance that the right sample is the fifth one. 
Therefore, the combined statistical metrics of all 
samples would be inclined to approach the fifth 
sample's statistical metrics. 

This example can be generalized for n 
observations, such that k observations among them 
are wrong, where n > 2and1<k<n-1. 

With a computer program, one studies each of 
the C"-* samples resulted after discarding k wrong 
observations, where (""* means combinations of n 
elements taken in groups of n-k elements. Each 
sample has the size n-k. For each sample one 
calculates its median, mean, deviations, standard 
deviations, and of course other statistical metrics 
required by the neutrosophic problem to solve. 
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Then we combine (”* results using either 
interval style, the average style, the weighted 
average style, or other procedures that the reader 
may design depending on the problem. 
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Neutrosophic Binomial 
Distribution 


The classical Binomial Distribution is extended 
neutrosophically. That means that there is some 
indeterminacy related to the probabilistic 
experiment. 

Suppose each trial can result in an outcome 
labeled success (S), or its mutually exclusive 
outcome labeled failure (F), or some indeterminacy 
(J). 

For example: tossing a coin on an irregular 
surface which has cracks, the coin can fall inside a 
crack on its edge, and thus one gets neither head, 
nor tale, but indeterminacy. 

We conduct a fixed number of small experiments 
(that we call trials). The outcomes of the trials are 
independent. For each trial, the chance of getting S 
is the same; similarly for the chance of getting F, or 
of getting I. 

The neutrosophic binomial random variable 
xis then defined as the number of successes when 
we perform the experiment n 2 1 times. 

The neutrosophic probability distribution of x 
is also called neutrosophic binomial probability 
distribution. 

For n trials it is important the way one defines 
the indeterminacy. 
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First, it is clear that getting indeterminacy in 
each trial means indeterminacy for the whole set of 
n trials. Secondly, getting indeterminacy in no trial 
means no indeterminacy for the whole set of ntrials. 

But what about getting indeterminacy in some 
trials, and determinacy (i.e. success or failure) in 
other trials? 

This partially indeterminate and partially 
determinate set of n trials depends on the problem 
one needs to solve and on the expert’s point of view. 

One can define an indeterminacy threshold: 

th = number of trials whose outcome is indeterminate, 
whereth € {0,1, 2,..., n}. 

The cases with a threshold > th will belong to the 
indeterminate part, while for a threshold < th they 
will belong to the determinate part. 

Let P(S) = the chance a particular trial results in 
a SUCCESS, 

and P/F) =the chance a particular trial results in 
a failure, for both S and F different from 
indeterminacy. 

Let P(I) = the chance a particular trial results in 
an indeterminacy. 

For x € {0,1,2,..,n}, NP (exactly x successes 
among n trials) = (Ty, I, F,), with 
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ye uL  P(S) > CE, PO P(Eyn st 
TN SE P(S)* 
x Gaal (S) 
(n — x)! - 
>) Gaze OO 
th 
n! oNV POPE 
=a eS) 4 k!(n—x—k) 
Similarly: 
z Sant ERDE PE 
iis 2 B=) 7 PO l 2. k! (n — y — k)! sue 
y=0 y=0 k=0 
ytX ytX 


n! 
a 2 z! ccc 94 


È EPOE PE e| 


= E n! I zZ 
E pw 
E a ro EUSES i 
- pe S Py 
z=th+1 
NO P(S)K + P(E)n--* 
42, k!i(n-—z—k)! [| 
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where Cj means combinations of u elements 

taken by groups of v elements: 
"E u! 
C= v! (u — v)! 

and u! is u factorial, u! 2 1:2-:3^ ...- u. 

Also: 

T, = chance of x successes, and n — x failures 
and indeterminacies but such that the number of 
indeterminacies is less than or equal to 
indeterminacy threshold; 

FE, = chance of y successes, with y + x and n— 
yfailures and indeterminacies but such that the 
number of indeterminacies is less than or equal to 
the indeterminacy threshold; 
and I, = chance of z indeterminacies, where z is 
strictly greater than the indeterminacy threshold. 

T, + L, + Fe = (P(S) + PU) + P(F))". 

In most applications, 

P(S) + P(I) + P(F) = 1, 

and this case is called complete probability. 

But for incomplete probability (where there is 
missing information): 

0 € P(S) + P(I) + P(F) <1. 

While in the paraconsistent probability (which 
has contradictory information): 

1 < P(S) + P(I) + P(F) € 3. 


An Example. 
Among the watches sold by a store 80% had a 
digital display and 10% an analog display. There is 
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a number of watches sold for which the storeowner 
has no evidence about their type of display, and he 
asks his manager assistant about them. Not 
knowing the manager’s previous estimations, the 
assistant estimates the unknown type of watches to 
be 20%. 

Let’s consider a neutrosophic random variable 

x = the number of watches among the next 5 
buyers that have an analog display. 

Therefore: 

P(F) = P(digitaldisplay) = 0.8, 
P(S) = P(analogdisplay) = 0.1, 
P(I) = P(indeterminacy) = 0.2. 

We got a paraconsistent neutrosophic probability 
since the information comes from the different 
sources that estimate independently. We have 
contradiction between the estimations of the 
manager and his assistant, because 

0.8 + 0.1 + 0.2 = 1.1 > 1. 

We have a neutrosophic binomial distribution. 

Let’s say the indeterminacy threshold is 2. 

We define the random variable X as follows: 

x = number of watches that have an analog 
display among the next 5 watches to be bought; 


2, (0.2) (0.8)5-*-K 

= k'(5—x—k)' 

where x 20,1,2,3,4,5. 

The chance that exactly 2 watches are analog, i.e. 


NS 
T, = = (0.1): 
k 


x = 21s: 
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Bh ass 
-9 0D 013! iizi 


(0.2)? (0.8)! 
Ee 


z. |5 (0-008) 777 
SD, zl 7 (02) d dor 
m" 


T - (0.8)? A (0.2): (0.8)? 


= 0.0992. 


l - 2 (0.1)(0.8)2-* 


k C-K)! |orz= 3) 


2, (0.1)*(0.8)1-* 


5! 
tq 02* . D: KOW [vos =4)+ 


0 

5! (0.1)*(0.8)1-* 

5, Z 
g2 D KCH! (forz = 5) 

(0.1)? (0.8)? (0.1)1(0.8)? (0.1)? (0.8)? 
0! 2! 1!1! 2!0! 
(0.1) (0.8)!  (0.1)1(0.8)? 
0! 1! 1! 0! 

(0.1)9(0.8)9 
. Dig: |e eee See O 
+ 1: (0.2) | Oron — 0.07232. 
F> can easier be computed (instead of using its 
combinatorial formula) as : 
F, = (P(S) + PU) + P(F)^ - To — I, 
= (0.1 + 0.2 + 0.8)5 — 0.0992 — 0.07232 
= 1.43899. 
If we normalize the vector 
(Tz, I5, F2) = (0.0992, 0.07232, 1.43899) 


= 20- (0.2)? - 


+ 5- (0.2)4- 
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by dividing each vector component by their total 
sum 

0.0992 + 0.07232 + 1.43899 = 1.61051, 

we get (T>, I>, F2) = (0.061595, 0.044905, 0.893500). 

For incomplete and paraconsistent probabilities 
it doesn’t matter if we normalize at the beginning or 
at the end, we'll get the same result. 


Remark. 

Since a third component (the chance of 
indeterminacy) was added to the  binomial 
distribution, the neutrosophic binomial distribution 
actually resembles a summation of classical 
trinomial distribution: 

(pi * ip)" 

where p, and p; are the probabilities that the two 
mutually exclusive events ( F4 and E, ) occur 
respectively, while «» is the chance of getting 
indeterminacy. 

Lets denote by A(a,f,y) the probability of 
obtaining a events E}, B indeterminate events, and 
y events E;, where of course 0 < a,f,y <n, and a + 
B +y =n, as results of n independent trials. 

Of course, as in classical trinomial distribution, 
one has 


n! 
A(a, B, y) = ally! pr iP py 


with n=a+f+y. 
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We need to define what indeterminacy means 
within n trials. Let th be the indeterminacy 
threshold. For th+ 1 or more indeterminacies, we 
consider them as indeterminacy, otherwise we have 
determinacy. 

Then for x € (0, 1,2....,n}, 

NP(exactly x events E, among n trials) = (Ty, Iy, F,), 
where : 

T, = 2. A(x, B,n — x — f) 
Osfsth 
pc > A(a, B,n — a — B) 


th+1<Bsn 
Osasn-th 


F, = 5 A(a, B,n — a — p) 
Osasn, a#x 
Ospsth 
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Neutrosophic Multinomial 
Distribution 


The previous neutrosophic binomial distribution 
is generalized for the case when at each trial there 
are r(=2) possible outcomes and some 
indeterminacy. 

Suppose all possible outcomes are 

Ey, E>, «Ep 
with corresponding chances to occur 
Py, Pz, ..., P. 

and some indeterminacy I with corresponding 
chance to occur 1. 

Then we have the multinomial expansion: 

(P, + Pp +--+ P, +i)” 

for n trials. 

Lets denote similarly by A(a,05,..,a.B) the 
probability of obtaining: exactly a, events E,, a; 


events E, ... , a, events E,, and B indeterminate 
events, 
where 0 € a4, @,...,a,,B <n 


anda, &2 +=: +a, +p =n, 
as results of n independent trials, then 
A(a4, Q5, ... , Ar, B) 


n! 

a a a E 

Ee o A EU, 
Q4! a2! ... ay! f 


Consider the same th as indeterminacy treshold. 
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Let the random variable X; denotes the number 
of times events E; occurs, for any j {1,2,...,r},in n 
independent trials. 

So we have a multivariate distribution. 

Then the neutrosophic probability of obtaining 
exactly x, events F4, x; events Fz, ..., x, events E,, 
in n trials is 


(Tx, X2, Xr? I, xa xp Pe, xs, n) where 


Ty,, N = x: A(X1, X2, Xp, D) 


Ospsth 


I 
M 


A(Q4, &5 ..., Ay, B) 


I, X2, Xr 
th+1sBsn 
Osa sn-th, for je(1,2,..,r) 


Fe, X2, Xr 


b 0sB sth A(04, a5. ..., Ay, B). 


(24,05,..,&*)€(1,2,.., A NGG X2, Xy) 
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Neutrosophic Scatter Plot 


ANeutrosophic Scatter Plot is a picture of 
points (x, y), such that at least a point is not well 
defined. 

For example the point (3, 5) is well defined, while 
the points ([2, 4), 7) or (-6, [O, 1]) or ({—2,—4}, 3) or 
([1, 2], [5, 7]) are imprecise. 

As an example, let’s consider a sample of size 

n = 4 yielding the accompanying data: 


Neutrosophic Observation 





0 1 2 3 4 5 6 x 
2D NEUTROSOPHIC SCATTER PLOT 
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The bivariate neutrosophic scatter plot has, 
besides points as in classical scatter plot, also 
segment of lines or parts of segments of lines, or 
surfaces, or parts of surfaces (geometrical objects of 
dimensions 1 or 2). 

In general, an n-variate neutrosophic scatter 
plot formed by n - 1 independent variables and one 
dependent variable, is composed of geometrical 
objects of dimensions O, 1, 2, ..., or n. 

A neutrosophic dependent or response 
variable is a dependent variable that has some 
indeterminacy. 

Similarly, a neutrosophic independent or 
predictor variable is a variable that has some 
indeterminacy. 

A neutrosophic function 

fn %1,%2, .., Xn) = 0 
is a function depending on variables x4,x;,..., X4 
such that the function has at least one 
indeterminate coefficient, or at least one of its 
independent variables %4,%,...,X, has some 
indeterminate value or is unknown. 

Indeterminate coefficient or indeterminate value 
can be a subset with two or more elements. 

The graph of a neutrosophic function in general 
has a higher dimension than the graph of a 
corresponding classical function (whose indeter- 
minacies have been removed). 
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For examples, the classical function f(x,y) = 0 
represents a curve in the 2D-space, while the 
neutrosophic function fy (x,y) = 0 can be a surface. 

The classical function f(x,y,z) = 0 represents a 
surface in 3D-space, while the neutrosophic 
function fy(xy,z)-0 can represent a bigger 
surface or a solid. 

And in general while a classical function 

f(k1, X2, ..., Xn) = O 
is a geometrical object of dimension d in the n- 
dimensional space, a neutrosophic function 
fn(x1, X2, 66 Xn) -0 
is a bigger (as volume) geometrical object of 
dimension d, or a geometrical object of dimension > 
d. 

The study of a neutrosophic function becomes 
more difficult when, for example, a function's 
coefficient or a value of one of its independent 
variables is completely unknown. 

More classical statistical formulas can be 
neutrosophically extended by replacing the 
operations on crisp numbers with operations on 
sets, that we present below. 


Let's S, and S, be two sets of numbers. 

Then: 

Sı + S5 = (x4 + x5|x4 € S1andx, € Sz}(set addition) 
Si — S, = (x4 — X2|x, € Syandx, € S,}(set substraction) 
S1:S,- (x4: x,|x, € Sqandx; € S }(set multiplication) 

a:S1—S,:a-—ía:x4|x, € S,}(scalar multiplication) 


75 


Florentin Smarandache 





a + S =S,+a={at+x,|x, € S,}(scalar set addition) 
a—S, = {a—x,|x, € S,}(scalar set substraction) 


Sı — a = {x, — a|xı € S,}(scalar set substraction) 
Sy (x, NS 
m {= x1 E S4,X2 E Sz,X2 #0 } (set division) 

2 





X2 
S? = {xf'|x, € S1 }(set power) 








Si (x, E 
Pm {= xı E Sa Æ 0] (set scalar division) 
a 

T-— {= x1 E S1, X1 $ 0} (set scalar division) 
Sy X1 


Ms; = Gala € S4} (root indexnof a set) 


As generalizations wehave: 
m 
2. S; = (072, xi |x; € S;for alli = 1, 2, ..., m]. 
i=1 


Similarly : 
m 


5; = (21 x; |x; € S,for alli = 1,2,..., m}. 


i=1 
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Neutrosophic Regression 


Neutrosophic Regression is the analysis of the 
association between one or more independent 
variables and a dependent variable that are 
expressed by neutrosophic values. This association 
is usually formulated as a neutrosophic equation or 
formula, which enables prediction of future values 
of the dependent variable. 

The graph of this association is, instead of a 
curve in classical statistics, for example: 


a neutrosophic curve (we can call it a „thick curve", 
or ,strip curve"), like: 
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Ls 


since in neutrosophic theory one deals with 
indeterminacy and approximations. 

As in classical statistics, the neutrosophic 
regression may be linear (if the association between 
independent, and dependent variables is linear), or 
non-linear (if the association is non-linear). Among 
the neutrosophic non-linear regressions of second 
degree one mentions the parabolic, elliptic, and 
hyperbolic regressions. 
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Neutrosophic Least- 
Squares Lines 


The Neutrosophic Least-Squares Lines that 

approximates the neutrosophic bivariate data 
Gc, 31), Gc, ya), + Xn Yn) 

has the same formula as in classical statistics 

$-a-ctby 

where the slope 
b= Xxy -[Q:3)02»2/n] 

ye? = [È x)?/n] 
and the y - intercept 
a=y-—bx 

with x the neutrosophic average of x, 

and ythe neutrosophic average of y. 

One uses the circumflex accent ^ above y in order 
to emphasize that ¥ is a prediction of y. 

The only distinction from classical least-square 
line is that in neutrosophic theory we work with sets 
instead of numbers. 

Therefore, into the data, some x's or y’s are 
imprecise, expressed by sets. The consequence is 
that « a » or « b » could result in being sets instead 
of numbers. 

Let's see an example. 


79 


Florentin Smarandache 








Neutrosophic x y x? xy 
Observation 





Neutrosophic 
Predicted Value ji 


d jg — tus fs fhe) fl 3987187959) | (177985, 24.3587) | 
3 1 2 1 ET 7871. 12.2073 zio 2073, 23. 7871 
10,13 36, 49 CX 91 -41.7367, 32.6443 
6 3 5 9 == a 20; 93, 25: 3838 -20. 


Sum 24, 26 38, 44 130, 152 zis = [1862.469 | 468 
t T 





Neutrosophic 
Residual yi, ji 












































TABLE OF A NEUTROSOPHIC SAMPLE 


An example of calculation with sets: 
22e [1,3] - 6 + 2 + (10,13) + 5 + (14, 15] 
=(1+6+2+4+104+5,3+6+2+13+5) 
+ {14,15} = (24, 29) + (14,15) 
= {(24, 29) + 14, (24, 29) + 15} 
= {(38, 43), (39, 44)} = (38, 44). 





Whence: 
_ (215,264) - [(24, 26): cean 
(130, 152) — (2428) 
(215, 264) — puente 
~ 30152) Eee 


(215,264) — (152,191) (24,112) 
~ (130,152) — (96,113) (17,56) 
B e 112 


ze) = (0.42857, 6.58824). 


Since 


y = (4,433333) 
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and 
_ (8844) (3844 _ 
== (==) ~ (6.33333, 7.33333) 
we get 
a = (6.33333, 7.33333) — (0.42857, 6.58824) - 
(4, 4.33333) = (6.33333, 7.33333) — (1.71428, 28.549) = 
(—22.2157, 5.61905). 
Thus, the neutrosophic least-squares line is : 
$ = (—22.2157, 5.61905) + (0.42857, 6.58824)x. 
Lets graph this «line» which actually is a 
geometrical surface between two lines. 
If x = 0,9 = (—22.2157, 5.61905). 
If x = 1,9 = (—22.2157 + 0.42857, 5.61905 + 
6.58824) = (—21.7871, 12.2073). 
We plot these neutrosophic points, which are 
actually segments of line. 
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Neutrosophic Predicted Values are computed as 
$i = (—22.2157, 5.61905) + (0.42857, 6.58824)x;, 
fori = 1,2, ...,6. 

Hence: 
F = (—22.2157, 5.61905) + (0.42857, 6.58824) - 2 
= (—22.2157 + 0.4285 - 2,5.61905 
+ 6.58824 - 2) = (—21.3587, 18.7955). 
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y; = (—22.2157, 5.61905) + (0.42857, 6.58824) - [4,5] 
= (—22.2157 + 0.42857 - 4, 
5.61905 + 6.58824 - 5) 
= (—20.5014, 38.5603). 
Fz = (—22.2157, 5.61905) + (0.42857, 6.58824) - 1 
= (—22.2157 + 0.42857, 5.61905 
+ 6.58824 - 1) = (—21.7871, 12.2073). 
31 = (—22.2157, 5.61905) + (0.42857, 6.58824) - (6,7) 
= (—22.2157 + 0.42857 - 6,5.61905 
+ 6.58824 - 7) = (—19.6443, 51.7367). 
Fz = (—22.2157,5.61905) + (0.42857, 6.58824) - (8) 
= (—22.2157 + 0.42857 - 8,5.61905 
+ 6.58824: 8) = (—18.7871, 58.325). 
Fz = (—22.2157, 5.61905) + (0.42857, 6.58824) - 3 
= (—22.2157 + 0.42857 - 3,5.61905 
+ 6.58824 3) = (—20.93, 25.3838). 
The Neutrosophic Residuals are computed in 
the same way as in classical statistics: 
V1 — Vu V2 — Pz, > Yn — Yn 
where yj are the real values of variable y, 
and 5, are respectively their predicted values. 
The neutrosophic residuals are: 
y — 31 = [1,3]— [(22.2157, 5.61905) 
+ (0.42857, 6.58824) - 2] 
= [1,3] — (21.3587, 18.7955) 
= (1 — 18.7955, 3 — (—21.3587)) 
= (-17.7955, 24.3587). 
Yo — Yo = 6 — [(—22.2157, 5.61905) + (0.42857, 6.58824) 
-[4,5]) = 6 — (—20.5014, 38.5603) 
= (—32.5603, 26.5014) 
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ys — y3 = 2—[(—22.2157, 5.61905) + (0.42857, 6.58824) 
:1] 2 2—(-21.7871, 12.2073) 
= (—10.2073, 23.7871). 

ya — Ya = (10,13) — [(—22.2157,5.61905) + 
(0.42857, 6.58824) - (6,7)] = (10,13) — 

(—19.6443, 51.7367) = (—41.7367,32.6443). 

ys — yz = {14, 15} — [(—22.2157, 5.61905) + 
(0.42857, 6.58824) - 8] = (14,15) — (18.7871,58.325) = 
(—44.325, 33.7871). 
ys — Ve = 5 — [(—22.2157, 5.61905) + (0.42857, 6.58824) 

-3] = 5 — (—20.93, 25.3838) 
= (—20.3838, 25.93). 

It is remarkable to observe that each real value 
of belongs to or it is included in the predicted value 
interval: 

y = [1,3] € (21.3587, 18.3955); 

Yo = 6 € (—20.5014, 38.5603); 

ys = 2 € (—21.7871, 12.2073); 

y4 = (10,13) c (—19.6643, 51.7367); 

ys = (14,15) c (—18.7871, 58.325); 
yg = 5 € (—20.93, 25.3838). 


Deneutrosofications. 


a. Another idea of solving this problem would 
be to transform the neutrosophic data in classical 
data, either taking the midpoint of each set, or the 
average of a discrete set of the form {...}. Or taking 
small neighborhoods centered in the midpoints of 
each set. Or taking the minimum values of the sets 
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and thus constructing multiple classical data. Then 
one computes the least-squares line for each data. 
Afterwards one makes the average of the results, or 
one considers the min/max interval of the results. 

b. Or one transforms the neutrosophic least- 
squares line into a classical least-square line by 
replacing the set representations of the coefficients 
«a» and «b» by their corresponding midpoint, or 
(depending on the application) by other interior 
points of the two sets. In our previous example, 

9 = (— 22.2157, 5.61905) + (0.42857, 6.58824) «x 
becomes 


$ = —8 + 3.5x, 
where —8 is close to the mipoint of 
(—22.2157, 5.61905), 
and 3.5 is close to the midpoint of 


(0.42857, 6.58824). 

c. One could take the midpoints of the 
neutrosophic predicted values neutrosophic 
residuals, or initial neutrosphic data; or smaller 
neighborhoods centered in the midpoints; or min 
values and max values separately and obtaining 
multiple classical data and calculating the needed 
statistical characteristic for each of them, then 
averaging the results. 

Lets compute the midpoints of neutrosophic 
predicted values and neutrosophic residuals: 
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Neutrosophic Neutrosophic 
Predicted Value Residual Midpoint 
Midpoint 
-1.2816 3.2801 
9.0295 -3.0295 
-4.7899 6.7899 
16.0467 -4.5462 
19.7690 -5.2690 
2.2269 2.7731 
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Neutrosophic Coefficient 
of Determination 


We compute the Neutrosophic Residual Sum of 
Squares, denoted by NSSResid, given by: 


NSSResid = 2,0 -5» = >” -a) y- b) xy 


and the Neutrosophic Total Sum of Squares, 
denoted by 


NSSTo = 2,0 =¥)? = 2» — Qu» 


n 

The Neutrosophic Coefficient of 
Determination, denoted by rz, is : 
NSSResid 


TN = 1- NSSTo ° 
and represents the proportion of variation in y, 
when considering a linear relationship between 
variablesx and y. 
NSSResid = 3.2801? + (3.0295)? + 6.7899? 
+ (—4.5462)? + (—5.2690)? + (2.7731)? 


= 122.16; 
2 38,44)? 
NSSTo — > y? — en = (362,468) — oo 
= (362, 468) sls 
7 f 6° 6 


= (362, 468) — (40.1111, 53.7778) 
= (362 — 53.7778, 468 — 40.1111) 
= (308.222, 427.889). 
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Whence 
i ud 122-16 Aes (S -16 122.16 ) 
N (308.222, 327.889) 327.889' 308.222 

— 1 — (0.3726,0.3963) 
= (1 — 0.3963,1 — 0.3726) 
= (0.6037, 0.6274). 

So between 60.37% and 62.74% of the sample 
variation is explained by the neutrosophic 
approximate linear relationship between x and y. 

The Neutrosophic Correlation Coefficient or the 
product moment neutrosophic coefficient TN 
(extension of Pearson's correlation coefficient from 
crisp data to neutrosophic data), has the same 
formula as in classical statistics, but we work with 
sets instead of numbers: 


Nosy HD AY 





WU nie- Oxy ney? —- Oy 
or 
- Sxy 
WRIT 


where $,,is the neutrosophic covariance of x — 
and y values, and $,,$, are the neutrosophic 
sample standard deviations. 

Let's consider the example from the previous 
Table of Neutrosophic Sample of size 6. 
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TN 
6 - (215, 264) — (24, 26) - (38, 44) 
46: (130,152) — (24, 26)? - [6 - (362, 368) — (38, 44)?] 
(6 - 215,6 : 264) — (24: 38,26: 44) 
[(6 : 130, 6 - 152) — (242, 26?)] - [(6 - 362,6 - 468) — (382, 44?)] 
(1290, 1584) — (912, 1144) 


[(780, 912) — (576, 676)] - [(2172, 2808) — (1444, 1936)] 
(1290 — 1144, 1584 — 912) 


4 (780 — 676,912 — 576) - (2172 — 1936, 2808 — 1444) 


] (146,672) " (146,672) 
/(104,336)-(236,1364) (104 - 336,336 1364 
(146,672) (146,672) 


— (V34944,/458304) (186.933, 676.982) 
146 672 

7 a T 

In general ry is a subset of the interval [—1, 1]. If 
ry is a subset of [0,1] then the points (xj, yi) for i = 
1,2, ..., n, lie approximatively near a straight line of 
positive slope, while when ryis a subset centered 
or almost centered at O (or ry is nearly half in [O, 1] 
and nearly half in [—1,0] then their is virtually no 
linear approximation but their may be a non-linear 
association between the points. 


) = (0.2157,3.5949) = (0.2157, 1]. 


Neutrosophic Random Numbers is a sequence 
of numbers and indeterminacies occurring at 
random with equal probability. 

The occurrence of a number or indeterminacy is 
not a guide to the numbers or indeterminacies 
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that follow it, nor is it predicted from the numbers 
or indeterminacies that precede it. 

Using eleven balls numbered O to 9 and another 
one ball that has its number erased (which one 
cannot read, that we note by J), then repeatedly 
withdrawing a ball and putting it back to the 
container. 

We randomly generate the sequence: 

2,9,9,1,0,7,6,2,1,1,/,8..., 

where I= indeterminacy. 

The computers can be enabled to generate 
neutrosophic random numbers using the same 
classical algorithms as for classical random 
numbers, but adding one or more states of 
indeterminacies with an equal chance of occurring 
each of them. 

As a generalization we proposed the 
Neutrosophic Weighted Random Numbers, 
where each number x; has a different chance p; to 
occur, and each indeterminacy J; has a different 
chance r; to occur. 

There are also cases when the numbers have to 
be in a given set; for example, each number should 
have k digits. 
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A Neutrosophic Normal 
Distribution 


A Neutrosophic Normal Distribution of a 
continuous variable X is a classical normal 
distribution of x, but such that its mean ų or its 
standard deviation o (or varianceo?), or both, are 
imprecise. 

For example, p, or o, or both can be set(s) with 
two or more elements. The most common such 
distributions are when p, o, or both are intervals. 

The neutrosophic frequency function formula is 
the same, except, as explained in the introduction, 
replacing p by pn and o by ox: 

1 x — Uy)? 
Xy Ny (ny, oi) = su P (- EM 
where Xy actually means that variable X may be 
neutrosophic (i.e. having some indeterminacy), and 
similarly Ny (..) meaning that the normal 
distribution N(. ..) may be neutrosophic (i.e. having 
some indeterminacy). 

Instead of one bell-shaped curve, we may have 
two or more bell-shaped curves that have common 
and uncommon regions between them and are 
above the x-axis. Each one is symmetric with 
respect to the vertical line passing through the 
mean (x = p). 
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As a first neutrosophic example for normal 
distribution, let's consider a normal distribution 
with p = 15 and o = [2, 3]. Thus the standard 
deviation is indeterminate. 


Ny, (15,2?) 


Ny, (15,0?),2 <0 <3 


Nw, (15, 32) 


Within one standard deviation of the mean 
translates in this first example by: 
pio =15+[2,3] = [15 - 3,15 + 3] = [12,18], 
or approximately 68% of values lie between 
x € [12,18]. 
Within two standard deviations of the mean 
translates by: 
pt2o=1542-[2,3] = 15 € [4,6] = [15 — 6,15 + 6] 
= [9,21], 
or approximately 95,4% of values lie between 
x € [9,21]. 
We could also compute the last interval as: 
[12,18] + c = [12,18] + [2,3] = [12 — 3, 18 + 3] 
= [9,21]. 
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For three standard deviations: 
pt3o = 15 3- [2,3] =15+[6,9] = [15 — 9,15 + 9] 
= [6,24], 
or we could compute it as 
[9,21] + [2,3] = [9 — 3,21 + 3] = [6, 24], 
and approximately 97,7% of values lie between 
x € [6,24]. 

The area between the lowest and the highest 
curve for each portion represents the burden 
(indeterminacy) of the graph. 

The neutrosophic normal distribution can be 
regarded as a bell-shape curve with heavy margins. 

A random variable X that has a neutrosophic 
normal distribution is called a  neutrosophic 
normalvariable. 

A second neutrosophic examplefor normal 
distributionwhere u = [15,17] and o = 2, hence now 
p is indeterminate. 


* 
Y x 
x 
æ, 
D 








o 


E Ss era 





H 


10 


Similar discussion for the second example: 
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Within one standard deviation, i.e. 

u +o = [15,17] +2 = [15 — 2,17 + 2] = [13,19], 
approximately 68% of values lie between x € 
[13, 19]. 
Within two standard deviations, i.e. 
Uu t2e = [15,17] £2:2 = [15,17] £4 = [15 2 4 17 + 4] 
= [11,21], 
or computed as 
[13,19] +o = [13,19] +2 = [13 — 2,19 + 2] = [11,21]. 
And within three standard deviations, i.e. 
ut3o = [15,117] +3: 2 = [15,17] + 6 = [15 — 6,17 + 6] 
= [9,23], 
or computed as 
[11,21] + 2 = [11 — 2,21 + 2] +2 = [9,23], 
approximately 97.7% of values lie between 
x € [9,23]. 

A third neutrosophic example of normal 
distribution with u = [15,17] and ø = [2,3], hence 
double indeterminacy, combines the previous 
second graph with the first one. 

Of course, the vagueness becomes wider! 

With u = [15,17] and o = [2,3], we get: 

Within one standard deviation of the mean, i.e. 

uto = [15,17] + [2,3] = [15 - 3,17 + 3] = [12, 20], 
approximately 68% of values lie between 
x € [12, 20]. 
Within two standard deviations of the mean, i.e. 
uU x26 = [15,17] +2: [2,3] = [15,17] + [4, 6] 
= [15 — 6,17 + 6] = [9,23], 
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or computed as [12, 20] + [2,3] = [12 — 3,20 + 
3] = [9, 23], 

approximately 95.496 of values lie between x € 
[9, 23]. 

And within three standard deviations of the 
mean, i.e. 

uw +36 = [15,17] + 3: [2,3] = [15,17] + [6,9] 
= [15 — 9,17 + 9] = [6, 26], 
or computed as [9,23] + [2,3] = [9 — 3,23 + 3] = 
[6, 26], 
approximately 97.796 of values lie between 
x € [6,26]. 


Neutrosophication of Other Distributions. 


In the same way, replacing one or more 
distribution parameters by a set, we can extend the 
classical distributions, such as: standard normal 
distribution, bivariate normal distribution, uniform 
distribution, sampling distribution, geometric 
distribution, hypergeometric distribution, Poisson 
distribution, chi-squared distribution, exponential 
distribution, frequency distribution, Pareto 
distribution, t-distribution, etc. to their 
corresponding neutrosophic versions. 

The set replacing a crisp parameter may have two 
or more elements, or may be empty (the last case 
meaning that the parameter is unknown). 
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A Neutrosophic 
Hypothesis 


A Neutrosophic Hypothesis is a statement about 
the neutrosophic values of a single or several 
population characteristics. 

The distinction between the classical (statistics) 
hypothesis and neutrosophic hypothesis is that in 
the neutrosophic statistics the variables that 
describe the population characteristics are 
neutrosophic (i.e. they have some indeterminate 
values, or several unknown values, or an inexact 
number of terms if the variable is discrete), or for 
the values that we compare at least one of the 
population characteristics is neutrosophic (i.e. 
indeterminate or unclear or vague value). 

Similarly to the classical statistics, a 
Neutrosophic Null Hypothesis, denoted by NHo, is 
the statement that is initially assumed to be true. 
While the Neutrosophic Alternative Hypothesis, 
denoted by NHa, is the other hypothesis. 

In carrying out a test of NHo versus NH, there are 
two possible conclusions: reject NHo (if sample 
evidence suggest strongly that NHo is false), or fail 
to reject NHo (if the sample does not support string 
evidence against NHo). 

Examples: 

NHo: y € [90,100] 
NHa: p < 90 
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NH: > 100 
NH,:p € [90, 100], 
where p represents the classical average IQ of all 
children born since 1st January 2001. 


NHg: x = 0.2 or 0.3 
NH,:n < 0.2 
NHg:tt > 0.3 
NH: n € (0.2, 0.3) 
NHa:q € {0.2, 0.3}, 
where m represents the classical proportion of all 
Ford cars that need repair while under first year of 
warranty. 


NHo: p < 0.1 orp > 0.9 
NHa:p = 0.1 
NH,:p = 0.9 
NH: p > 0.1 and p < 0.9 
NH,:p € [0.1, 0.9], 
where p represents the classical proportion of 
outliers in a human population with respect to their 
height, i.e. percentage of people whose height is less 
than 150 cm, or percentage of people whose height 
is greater than 190 cm. 
Neutrosophic Outliers are noticeably unusual 
values in the neutrosophic data; they can be crisp 
values or neutrosophic values. 


NHo: [B Pmax! > [0.45,0.55], 
which is equivalent to 
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Hmin > 045 
and 
Hmax 7? 0-55 
where u represents a neutrosophic percentage 
average of all electronic devices that get morally 
depreciated after three years from their fabrication; 
[Hnn max! is a  neutrosophic value (rough 
approximation). 
NHa? Win = 045 
NHa! Bag = 9-55 
NHa: Vipin < 0-45 
NH3: Vinay < 0-55 
NHa: Vin < 9-45 OT max < 0.45. 


NHo: u = 7.0 
NHa: p < 7.0 
NHa: u > 7.0 
NH: u # 7.0 
A manufacturing plant made an approximate 
survey of its selling, survey done by two 
independent observes on different samples of same 
size. Their findings are close, yet different. The 
owner of the manufacturing plant decided to put 
both results together, taking for each period the 
[min, max] or [inf, sup] interval, in order to see the 
fluctuation of sales. The variable x that describes 
the survey is thus a neutrosophic one: 
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Period Sold Quantity (in thousands) 
[4, 6] 

2001 

2002 [7, 8] 

2003 5.9) 0F 6.0 

2004 (8.0, 8.8) 

2005 T9 


The null hypothesis that the average annual 
selling p = 7.0 is in the classical style, but the 
variable x that p is referring to is neutrosophic. 

So we still have a neutrosophic hypothesis. 


Neutrosophic Hypothesis Testing Errors. 


A census of a large population is hard or even 
impossible to due. That’s why we have to use 
samples. The inference we are making from a 
neutrosophic sample characteristic to a population 
characteristic is subject to error. 

Similarly to classical statistics, there are two 
types of errors: 

1. Neutrosophic Type I Error, which is the error 
of rejecting NHo when NHo is true. 

2. Neutrosophic Type II Error, which is the 
opposite of the previous error, i.e. the error of not 
rejecting NHo when NHo is false. 
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No matter what test we do, there is some chance 
that a neutrosophic type I error will be made, and 
there is some chance that a neutrosophic type II 
error will be made too. 

For example, rejecting the hypothesis Ho: p = 7.0 
when it is true in one of the previous examples, 
would determine the owner of the manufacturing 
plant to take additional adjustments and spending 
money when not really needed. 

While accepting Ho: p = 7.0 when it is false, will 
damage the future selling. 

Probabilities of neutrosophic type I error and type 
II error are denoted by an (level of significance) and 
respectively By. 

Dealing with neutrosophic probabilities, an and 
Bx can be subsets of the interval [O, 1]. The ideal test 
procedure would havean= Buz 0, or anand fuas tiny 
intervals near zero. 

For example, ifan= [0.07,0.10]in a test procedure, 
done with different samples, over and over, a true 
hypothesis Ho is rejected about 7, 8, 9, or 10 times 
in a hundred. 

If By — [0.07,0.10], then a false hypothesis Ho is 
accepted about 7-10 times in a hundred. 


Example. 

A car manufacturer pretends that between 80% 
and 90% of its car need no repair during the first 2 
years of driving. In order to check the claim, a 
consumer agency obtains a random sample of 50 


100 


INTRODUCTION TO NEUTROSOPHIC STATISTICS 





purchasers and investigate them whether or not 
their cars needed repair during the first 2 years of 
driving. Let p denote the sample proportion of 
responses that indicate no repair, and let m denote 
the true proportion of no repairs (called successes). 
The appropriate neutrosophic hypotheses are: 
N Ho: T € [0.8, 0.9] versus NHg:7 < 0.8 

in order to check if the sample evidence suggests 
that m < 0.9. 

Neutrosophic Type I Error is to consider the car 
manufacturer's claim fallacious (i.e. m < 0.8) while 
in fact it is correct. 

And Neutrosophic Type II Error if the consumer 
agency fails to detect the manufacturer's incorrect 
claim. 

For avoiding serious consequences the consumer 
agency decides a type I error probability of 
[0.01,0.05] but no larger can be tolerated. So 

a —[0.01,0.05] is used for developing a test 
procedure. 

We recall, from classical statistics, that a 
classical standard normal distribution of a random 
variable z, is a normal distribution with the mean 
value 


u-0 
and standard deviation 
g — 1. 


Its corresponding curve is called standard 
normal curve or z curve. 
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A z critical value captures the lower-tail or 
upper-tail area, or the central area under the z 


curve. 
The table of the most used z critical values in 


classical statistics: 


Critical Area to the Area to the Area 
value, z right of z left of -z between -z 
andz 
1.28 PO BIO .80 
1.645 .05 .05 .90 
1.96 1025 1025 95 
2.33 .01 Oil .98 
2.58 .005 .005 .99 
3.09 .001 .001 .998 
9:20 .0005 .0005 .999 


A normally distributed random variable x can 
bestandardized as 





where u — x s mean value, 
and o = x's standard distribution. 
If the neutrosophic null hypothesis about 
variable x is: 
NHg:p € [a,b], 
where [a,b], with a<b, is the hypothesized 
interval, then the neutrosophic test statistic is: 
X — [a.b] 
= oi 
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where x is the sample mean, 
sis the sample standard deviation, 
and nis the sample size, with n > 30. 


Variable z has approximately a neutrosophic 
standard normal distribution. 

In neutrosophic statitics, x, s and even n can be 
sets (not necessarily crisp numbers). 


Alternative Hypotheses. 

Ha:u > b; Reject Hy if minz > z critical value 
(upper-tailed test); 

Hg:u<a; Reject Hy if maxz < —z critical value 
(lower-tailed test); 

Ha: u é [a,b]; Reject Ho if: either min z > z critical 
value, or maxz < —zcritical value (two-tailed test). 


Example. 

Let’s consider the exam-anxiety scores for a 
sample of an American College students were the 
following: 

n = 64,x = [48.0, 50.0], and s = 25. 

Then u = true mean exam-anxiety. 

Ho: u € [40.0, 41.0] 
Ha: 41.0. 
The neutrosophic test statistics is: 
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[48.0, 50.0] — [40.0, 41.0] [48.0 — 41.0,50.0 — 40.0] 


25/V64 j 25/8 
[7.0,10.0] 8-[7.0,10.0] [56.0, 80.0] 
er 387g. uu — Qo, - m: DES T 
56.0 80.0 


For a=0.10 the corresponding one-tailed z 
critical value from the previous table is 1.28. Hence 
Hg is rejected because z = [2.24, 3.20] > 1.28. 

In conclusion, the mean exam-anxiety score is 
higher than 41.0. 
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The Neutrosophic Level of 
Significance 


The Neutrosophic Level of Significance a may 
be a set, not necessarily a crisp number as in 
classical statistics. 

For example, a, = [0.01,0.10]is a neutrosophic 
level of significance a, where a varies in the interval 
[0.01, 0.10]. 

A Neutrosophic P-Value is defined in the same 
way as in classical statistics: the smallest level of 
significance at which a null hypothesis Hy can be 
rejected. 

The distinction between classical P-value and 
neutrosophic P-value is that the neutrosophic P- 
value is not a crisp number as in classical statistics, 
but a set (in many applications it is an interval). 

Neutrosophic  P-Value = P(z > zcritical value, 
whenHjistrue), where  P() means classical 
probability calculated assuming that Họ is true, 
probability of observing a test statistic value being 
more extreme than is was actually obtained. 

Suppose one has calculated the neutrosophic P- 
value at the particular level of significance a, where 
a is a crisp positive number. 

1. IfmaxíneutrosophicP — value) € a, then reject 
Hg at level a. 

2. Ifmin(neutrosophicP — value) » a, then do not 
reject Ho at level a. 
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3. If min{neutrosophicP — value}<a< 
maxíneutrosophicP — value} then there is an 
indeterminacy. Thus 

a — min(neutrosophicP — value} 
maxíneutrosophicP — value) — min(neutrosophicP — value) 

is the chance of rejecting Ho at level a, 

and 

max{neutrosophicP — value) — a 
maxíneutrosophicP — value) — min(neutrosophicP — value) 

is the chance of not rejecting Hy at level a. 

Let ay be a set. 

4. If max{neutrosophicP — value) € min{ay}, then 
reject Hy at level ay. 

5. If min{neutrosophicP — value} > max(ay), then 
do not reject Ho at level ay. 

6. If the two sets, those of the neutrosophic P- 
value and of the neutrosophic level of significance 
ay intersect, one has indeterminacy. And one can 
compute the chance of rejecting Hy at level ay, and 
the chance of not rejecting Hy at level ay. 

In classical statistics, the P-value is computed 
considering theTable of Standard Normal 
Probabilities. 

a. P-value is the area under the z curve to the 
right of computed z, for Upper-tailed z test. 

b. P-value is the area under the z-curve to the 
left of computed z, forLower-tailed z test. 

c. P-value is twice the area captured in the tail 
corresponding to the computed z, for Two-tailed z 
test. 
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Let’s insert from the classical statistics the 
Standard Normal Cumulative Probability Table [for 
positive z-values only, since this is needed in our 
below example]: 


Standard Normal Cumulative Probability Table | à 


Cumulative probabilities for POSITIVE z-values are shown in the following table: É 


0.04 0.05 0.06 0.07 0.08 





In the previous example, 
Hg: u e[40.0, 41.0] versus Ha: u > 41.0, 
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we found the neutrosophic z = [2.24,3.20]. We have 
an Upper-tailed z test. 

From the above Table of Standard Normal 
Probabilities, the area under the z curve to the right 
ofz, = 2.24 is 1 — 0.9875 = 0.0125 

while for 

Z = 3.20 is 1 — 0.9993 = 0.0007. 

Thus, the neutrosophic 

P — value = [0.0007, 0.0125]. 

At the level of significance a, = 0.10, reject Ho 

since 
max [0.0007, 0.0125] = 0.0125 < 0.10. 

At the level of significance a2 = 0.0005, do not 

reject Hy since 
max [0.0007, 0.0125] = 0.0125 > 0.0005. 

At the level of significance «4 — 0.01, one has 
indeterminacy since 

0.01 € [0.0007, 0.0125]; therefore: 

chance of rejecting Ho at level a3 = 0.01 is 

0.01 — 0.0007 0.0093 


0.0125 — 0.0007 0.0118 
and chance of not rejecting Hp at level a3 = 0.01 


= 79% 
is 


0.0125 — 0.01 0.0025 2 
0.0125 —0.07 0.0118 ^ ^" 
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The Neutrosophic 
Confidence Interval 


The Neutrosophic Confidence Interval for a 
population characteristics is defined, similarly to 
the classical statistics, as an interval of plausible 
neutrosophic values of the characteristic. 

The neutrosophic value of the characteristic is 
captured inside the interval with a chosen degree of 
confidence. 

A confidence level is associated with each 
neutrosophic confidence interval, as in classical 
statistics. It tells us how much confidence we have 
in procedure used in constructing the neutrosophic 
confidence interval. 

The classical formulas for the confidence interval 
are extended from crisp variables to neutrosophic 
variables (i.e. variables whose values are sets): 

1. When the neutrosophic value of the 
population standard deviation o is known, the 
Large-Sample Neutrosophic Confidence Interval 
for the Population Mean p is: 


o 

X + (zcritical value) - Ta 

where x is the large-sample neutrosophic mean, 
and n is the neutrosophic size of the large-sample. 

Thereforex, o, and/or n may be sets instead of 


crisp numbers. 
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2. When the neutrosophic value of the 
population standard deviation o is unknown (as in 
most practical applications), and the sample size 
exceeds 30, one uses the sample standard deviation 
s instead of o for computing the Neutrosophic 
Confidence Interval for the Population Mean p: 

X + (zcritical value) : mm 

For both formulas, the z critical value 1.645 
corresponds to the confidence level of 9096, the z 
critical value 1.96 corresponds to the confidence 
level of 9596, and the z critical value 2,58 
corresponds to the confidence level of 99%, 
similarly as in classical statistics. 

The confidence level of, for example, 9096 does 
not refer to the chance that the population mean p 
is captured in an interval, but to the percentage of 
all possible successful samples (i.e. samples for 
which p is included in the confidence interval). 

An Example. 

Many individuals partially loose vision because 
of exposure to dust. 

On a study involving 60 people (a sample), that 
were constantly exposed to dust to their 
construction work places, in average they lost 
1896-2096 of their vision accuracy, with a sample 
standard deviation of 496-5906. 

The study investigator wishes a 9096 confidence 
interval for p. Hence: 

x = [18,20] 
zcritical value = 1.645 
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s — [4,5] 
n — 60. 
Therefore, the neutrosophic confidence interval 
for the population mean p is: 
[4,5] 


[18, 20] + (1.645) T 


EPET 1.645(4) 1.645(5) 


v60 ' v60 
= [18,20] + [0.85, 1.06]. 
Let’s split into two parts: 
[18, 20] + [0.85, 1.06] = [18 + 0.85, 20 + 1.06] 
= [18.85, 21.06], 
and 
[18, 20] — [0.85, 1.06] = [18 — 1.06, 20 — 0.85] 
= [16.94, 19.15]. 

Combining these two cases we get the 

neutrosophic confidence interval: 
[16.94, 21.06]. 

The Neutrosophic Sample Size to estimate, 
within the amount B, with c% confidence, of the 
population mean p is: 

(zcritical value)-o 
mp 

where z critical value should correspond to the 
c?o confidence, 

o is the population standard variation, 

and ny is the resulting neutrosophic sample size, 
hence ny may be a set (especially an interval). 

For surety, we can take the sample size as 
[maxíny], where | | means superior integer part. 

Let's see an Example. 
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The business department wishes to estimate the 
annual cost of office supplies for faculty at the 
University of New Mexico to be within $40 of the 
true population mean. The business department 
wants a 95% confidence in their result accuracy. 

How large should the sample be? 

Because o is not known, it can be approximated 
as 

range 

Tho 

as in classical statistics. 

Range is the difference between the highest and 
lowest costs. 

The amount spent on office supplies varied 
between $500-$550 to $100-$150. Then 

[500,550] — [100,150] | [500 — 150,550 — 100] 


Se e 
~ = 


4 4 
[350,450] 350 450 
4. 7 e] 
= [87.50, 137.50]. 
Further, B = 40, z critical value is 1.96, and: 
1.96[87.50, 137.50]]’  [1.96(87.50) 1.96(137.50)]? 
207 40 | = | 40  ' . 40 
= [4.2875,6.7375]? = [4.2875?,6.7375?] 
=~ [18.38, 45.39]. 


Now 
[max[18.38, 45.39]] = [45.39] = 46. 
Therefore the sample size should be 46. 


112 


INTRODUCTION TO NEUTROSOPHIC STATISTICS 





Large-Sample 
Neutrosophic Confidence 
Interval for the 
Population Proportion 


Using the classical statistics one can define (in 
the same way) the Large-Sample Neutrosophic 
Confidence Interval for the Population 
Proportion n: 


p(1- p) 


p + (zcritical value) : " 


for the case when minínp) 2 5 and minín- (1 — 
p) 25, 

where 

p = sample proportion = number of sample 
individuals that possess the property of interest 
divided by sample’s size; 

n = sample’s size; 


It = population proportion = 
number of population individuals that possess the property of interest 


total number of population individuals 


with the distinction from the classical statistics 


that in neutrosophic statistics the parameters p and 
n may be setsinstead of crisp numbers, and the z 
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critical value may be a set as well(for example it may 
be [1.645, 1.96], i.e. confidence level of [90, 95]%). 
The neutrosophic sample statistics p, for min(n) 
large enough, has a neutrosophic sampling 
distribution (normal curve) that approximates the 
population mean m and its standard deviation 


(1-7) 
Lx. 


Let's see an Example. 

A survey on a sample of 200 - 220 consumers is 
done at a car dealer asking the following question: 
^Would you be willing to trade in your old car when 
buying a new car?" The number of yes's was 150. 
The confidence level should be 99%. If n denotes the 
proportion of all consumers who would trade in 
their old cars, one may consider p a point estimate 
for r: 

150 150 150 

P = 1300, 201,...,220) 1220'200 

The sample’s size {200,201,...,220} means that 
the surveyer was not sure about 20 people if they 
were or not custumers of this car dealer. So, the 
sample’s size is indeterminate (approximated by the 
set (200,201, ...,220}), 

z critical value = 2.58. 

min{np} = min((200, 201, ..., 220} - [0.68, 0.75]} 
= 200(0.68) = 136 » 5; 


| = [0.68, 0.75]. 
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min(n(1 — p)} = min{{200, 201, ...220) 
- (1 — [0.68, 0.75])} 
= 200 min([1 — 0.75, 1 — 0.68]) 
= 200: min([0.25,0.32]) = 200(0.25) 
=50>5. 
The  large-sample  neutrosophic confidence 
interval for n is: 






[0.68, 0.75] - (1 — [0.68,0.75]) 

(200,201, ..., 220} 

= [0.68, 0.75] + 2.58 

[0.68, 0.75] - [0.25, 0.32] 
(200, 201, ..., 220} 

= [0.68, 0.75] + 2.58 


[peram 275039) 


(0.68, 0.75] + 2.58: 


220 ^" 200 
= [0.68, 0.75] + 2.58 
: 4[0.000773, 0.001200] 
= [0.68,0.75] + 2.58 
: V0.000773, 0.001200 
= [0.68,0.75] + 2.58 
- [0.027803, 0.034641] 
= [0.68,0.75] + [0.071732, 0.089374]. 
Split it into two parts: 
[0.68, 0.75] + [0.71732, 0.089374] 


= [0.751732, 0.839374], 
and 
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[0.68, 0.75] — [0.071732, 0.089374] 
= [0.68 — 0.089374, 0.75 — 0.071732] 
= [0.590626, 0.678268]. 

Combining both results in a conservative mode, 
we get: 

[0.590626, 0.839374]. 

The formula for choosing the neutrosophic 
sample size is the same as in classical statistics, but 
using sets instead of crisp numbers: 
zcritical any 


B 
where B = the specific error bound. 


If m cannot be estimated using prior neutrosophic 
information, one uses mn = 0.5 which gives a 


n-220-2.| 


conservatively large sample value (i.e. a larger n 
than any other value of rt would do). 


116 


INTRODUCTION TO NEUTROSOPHIC STATISTICS 





The Neutrosophic Central 
Limit Theorem 


The Neutrosophic Central Limit Theorem, which 
is an extension of the classical Central Limit 
Theorem, can be safely applied if min{n} exceeds 30, 
where n is the neutrosophic sample size (i.e. n may 
be a set). 

The Neutrosophic Central Limit Theorem states 
that the neutrosophic sampling distribution of x si 
approximated by a neutrosophic normal curve 
when min{n} is sufficiently large, no matter how is 
the population distribution. 

Of course, if the population distribution is 
normal, then min{n} may be smaller than 30, and 
the neutrosophic sampling distribution of x is 
normal too for any neutrosophic sample size n. But, 
if the population distribution is not normal, then 
min{n} should be greater than 30, and the 
neutrosophic sampling distribution of x is only an 
approximation to the normal curve: the larger is 
min{n}, the better approach. 


The last result has enabled the neutrosophic 
statisticians in order to infer a population mean, to 
develop large sample neutrosophic procedures even 
when one deals with an unknown shape of the 
population distribution. 
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Using similar notations: 

n = random neutrosophic sample size; 

x = neutrosophic mean of the sample size; 

u = population mean; 

o = population standard deviation; 

lz = neutrosophic mean of the x distribution; 

and 

0; = neutrosophic standard deviation of the x 
distribution; 

one has, as in classical statistics: 

Hz = H, 


o 
and oz = 


T 

The neutrosophic central limit theorem does not 
apply, as in classical statistics, when min{n} is 
small and the shape of the population distribution 
is unknown. 


Let’s introduce the Small-Sample Neutrosophic 
t Confidence Interval for the Mean of the Normal 
Population, which is just a neutrosophication of 
the classical one-sample t confidence interval for 
the population mean p: 


S 
X + (tcritical values) : — 
T a 


where similarly: 
x = neutrosophic sample mean; 
s =neutrosophic sample standard deviation; 
n = neutrosophic sample size; 
and 
tcritical value is based on 
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min{n} — 1 degrees of freedom (df). 
x,s,andn may be sets instead of crisp numbers. 
For small min{n}, the neutrosophic t confidence 
interval for the population mean p is appropriate 
when the population distribution is normal or 
approximately normal. Otherwise, another method 
should be employed. 


The neutrosophic t distribution is more spread 
out, of course, than the neutrosophic standard 
normal (z) curve, because the use of s, instead of 
population deviation o, produces extra variability. 


The neutrosophic t distributions are 
distinguished from one another by the degree of 
freedom, which can be a positive integer greater 
than Or equal to 
1, or a set of positive integers greater than or equal to 1, 
for example: 

{n,n+1,...,n +m}. 

The higher is min(n], the closer the neutrosophic 
t distribution is to the neutrosophic z curve. For 
min{n} > 120 one may use the z critical values. The 
neutrosophic tcurve, for a fixed number of degrees 
of freedom, is in general bell-shaped and centered 
at zero in neutrosophic style way. 


An Example. 
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A small random sample of 18 workers, at the Rail 
Road, was investigated regarding the weights these 
workers are able to lift in their work place. The 
neutrosophic sample average found was x between 
8 kg and 10 kg, with a standard deviation s between 
3-4 kg. 

Let's say a confidence level of 9596 is required for 
capturing the population mean p.Thus: 

X = [8, 10](an interval) 
s = [3,4](an interval) 
n= 18, 
hence a small sample size, which requires a 
neutrosophic t critical value based on 18—1- 
17 df. 

From the below classical statisticsTable of t 
Critical Values,we find out that for 9596 confidence 
level and 17 df, the corresponding 

tcritical value = 2.11. 
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t Table 

cum. prob} ta fs tfa ts tẹ fs fæ tẹ teas fm lus 
oneta 050 0.25 0.20 015 010 005 000 001 0005 0.001 00005 
two-ails 


di 









2 000 0816 1001 130 186 290 4X9 695 995 237 31590 
3 000 076 098 120 41638 2383 318 454 58 10215 1294 
4 
E 


2704 339 351 
2660 322 3460 
1990 2699 319 3416 
2626 314 339 
1962 2581 308 3230 


80% 9 95% 90% 998% 99.9% 
Confidence Level 
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Apply the previous formula: 








E+ (teritical value) -~ = [8,10] + 2.11 4) 

X T critical Value m 5 We Lie Vis 
N 2.11(3) 2.11(4) 
= 18.1012 T 


= [8, 10] + [1.492, 1.989]. 
Split the calculation into two possibilities: 
[8, 10] + [1.492, 1.989] = [8 + 1.492, 10 + 1.989] 
= [9.492, 11.989], 
and 
[8, 10] — [1.492, 1.989] = [8 — 1.989, 10 — 1.492] 
= [6.011, 8.508]. 

Now we combine both results in a conservative 
way, and we get the neutrosophic t confidence 
interval for the population average of weight lifting: 
[6.011, 11.989] kg. 
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Neutrosophic Statistics means statistical analysis of 
population or sample that has indeterminate (imprecise, 
ambiguous, vague, incomplete, unknown) data. For 
example, the population or sample size might not be exactly 
determinate because of some individuals that partially 
belong to the population or sample, and partially they do 
not belong, or individuals whose appurtenance is 
completely unknown. Also, there are population or sample 
individuals whose data could be indeterminate. 


In this book, we develop the 1995 notion of 
neutrosophic statistics. We present various practical 
examples. It is possible to define the neutrosophic statistics 
in many ways, because there are various types of 
indeterminacies, depending on the problem to solve. 
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